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In  recent  years  a  great  deal  of  attention  has  been  focused  on  answering 
questions  regarding  the  cover  times  of  random  walks  on  simple  graphs.  In 
this  dissertation  we  answer  questions  about  the  edge  cover  time  of  such  walks. 

We  begin  by  reviewing  many  of  the  definitions  and  established  results  for 
vertex  cover  time.  We  present  the  little  that  has  been  published  regarding 
bounds  on  the  edge  cover  time.  Having  completed  this  review  we  establish  a 
new,  and  sometimes  more  useful,  global  upper  bound  for  edge  cover  time. 

We  then  narrow  our  focus  and  consider  the  edge  cover  time  of  the  path. 
We  establish  an  exact  description  of  the  edge  cover  time  for  a  random  walk 
on  the  path  started  at  an  endpoint  in  terms  of  coefficients  related  to  the 
Bernoulli  Numbers  of  the  Second  Kind.  Studying  these  coefficients  carefully 
allows  us  to  develop  a  tight  bound  on  this  cover  time  of  (n  — 1)^+0  (n^/  log  n). 
Using  these  results,  and  generalizing,  provides  a  description  of  the  edge  cover 
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time  for  walks  on  the  path  started  from  an  arbitrary  vertex.  This  general¬ 
ization  gives  us  a  bound  of  (5/4)(n  —  1)^  +  0{v? /\ogn)  for  the  edge  cover 
time  for  the  path. 

Having  established  a  tight  bound  for  walks  on  paths  we  then  focus  on 
other  trees.  We  prove  that  the  edge  cover  time  for  a  random  walk  started 
from  the  center  of  a  star  graph  minimizes  edge  cover  time  for  walks  on  all 
trees  on  n  vertices.  We  also  establish  the  fact  that  in  all  graphs  the  edge 
cover  time  for  a  walk  started  from  a  leaf  is  always  greater  than  for  a  walk 
started  from  its  point  of  attachment.  We  continue  our  study  of  trees  by 
establishing  a  global  upper  bound  on  the  edge  cover  time  for  all  trees  and 
use  it  to  study  balanced  k-ary  trees. 

Finally  we  show  the  connection  between  our  previous  developments  for 
the  edge  cover  time  for  paths  and  that  of  the  edge  cover  time  on  the  cycle. 
This  leads  to  a  tight  bound  for  the  undirected  and  directed  edge  cover  times 
of  this  graph. 
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Summary 


In  recent  years  a  great  deal  of  attention  has  been  focused  on  answering  ques¬ 
tions  regarding  the  cover  times  of  random  walks  on  simple  graphs.  In  this 
dissertation  we  answer  questions  about  the  edge  cover  time  of  such  walks. 

We  begin  by  reviewing  many  of  the  definitions  and  established  results  for 
vertex  cover  time.  We  present  the  little  that  has  been  published  regarding 
bounds  on  the  edge  cover  time.  Having  completed  this  review  we  establish  a 
new,  and  sometimes  more  useful,  global  upper  bound  for  edge  cover  time. 

We  then  narrow  our  focus  and  consider  the  edge  cover  time  of  the  path. 
We  establish  an  exact  description  of  the  edge  cover  time  for  a  random  walk 
on  the  path  started  at  an  endpoint  in  terms  of  coefficients  related  to  the 
Bernoulli  Numbers  of  the  Second  Kind.  Studying  these  coefficients  carefully 
allows  us  to  develop  a  tight  bound  on  this  cover  time  of  (n— l)^+©(n^/  logn). 
Using  these  results,  and  generalizing,  provides  a  description  of  the  edge  cover 
time  for  walks  on  the  path  started  from  an  arbitrary  vertex.  This  general¬ 
ization  gives  us  a  bound  of  (5/4) (n  —  1)^  +  0{n^ /\ogn)  for  the  edge  cover 
time  for  the  path. 
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Having  established  a  tight  bound  for  walks  on  paths  we  then  focus  on 
other  trees.  We  prove  that  the  edge  cover  time  for  a  random  walk  started 
from  the  center  of  a  star  graph  minimizes  edge  cover  time  for  walks  on  all 
trees  on  n  vertices,  paralleling  a  result  for  vertex  cover  time  on  trees.  In 
the  process  of  proving  this  we  also  establish  the  fact  that  in  all  graphs  the 
edge  cover  time  for  a  walk  started  from  a  leaf  is  always  greater  than  for  a 
walk  started  from  its  point  of  attachment.  We  continue  our  study  of  trees  by 
establishing  a  global  upper  bound  on  the  edge  cover  time  for  all  trees  and  use 
it  to  study  balanced  k-ary  trees.  Our  study  of  the  star  also  leads  to  alternate 
proofs  for  two  identities  involving  the  Stirling  Numbers  of  the  Second  Kind. 

Finally  we  show  the  connection  between  our  previous  developments  for 
the  edge  cover  time  for  paths  and  that  of  the  edge  cover  time  on  the  cycle. 
This  leads  to  a  tight  bound  for  the  undirected  and  directed  edge  cover  times 
of  this  graph. 


CHAPTER  I 


DEFINITIONS  AND  KNOWN 
RESULTS  FOR  VERTEX  COVER 

TIME 


1.1  Definitions 

A  graph,  G,  consists  of  a  pair  of  sets  {V,E}.  The  elements  of  the  set  V 
are  known  as  vertices.  The  elements  of  E  are  known  as  edges  and  consist 
of  unordered  pairs  of  the  elements  of  V.  In  this  thesis  we  will  consider  only- 
finite  simple  graphs,  so  V  will  be  a  finite  set,  and  E  will  contain  only  distinct 
pairs  from  V  in  which  each  element  of  the  pair  is  also  distinct.  We  will  follow 
common  convention  and  normally  indicate  the  size  of  the  vertex  set  by  n  and 
the  size  of  the  edge  set  by  m.  The  neighbors  of  a  vertex  x  are  the  elements 
of  the  set  {y  :  3{a:,  y}  G  E}.  The  degree  of  a  vertex  x,  d(x'),  is  the  cardinality 
of  the  neighbor  set  of  x.  We  will  also  discuss  the  directed  version  of  G  in 
which  we  replace  each  element,  {x,y},  of  the  set  E  by  two  ordered  pairs. 
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{x,y)  and  {y,x).  In  this  case  we  may  make  use  of  the  indeyree,  d-{x)  or 
outdegree,  d-^{x),  of  a  vertex  x.  However,  due  to  the  method  of  construction 
of  the  graph  it  is  clear  that  d+{x)  =  d-{x)  =  d{x).  We  will  rely  on  the  usual 
definitions  of  other  graph  concepts  such  as  whether  a  graph  is  connected, 
bipartite  or  acyclic,  etc.  A  standard  reference  for  these  concepts  is  Bondy 
and  Murty[5]. 

A  discrete  Markov  chain  is  a  stochastic  process  in  which 

P{Xn\Xn-l,Xn-2,  ...,Xo}  = 

and  this  probability  is  independent  of  n.  The  set  of  possible  outcomes  for  Xn 
is  known  as  the  state  space  of  the  chain.  We  will  denote  by  pij  the  probability 
P{Xn  =  j\Xn-i  =  i}.  If  P  =  Ipij]  is  the  state  transition  probability  matrix 
for  an  n  state  discrete  Markov  chain,  the  chain  is  irreducible  if  for  each  pair, 
{i,j},  of  elements  of  the  state  space  there  exists  some  integer,  fcpj},  such  that 
the  i,j  entry  of  is  nonzero.  The  probability  vector,  tt,  is  the  stationary 

distribution  for  an  irreducible  Markov  chain  if  Vj,  tTj  >  0,  =  1,  and 

ttP  =  tt.  a  Markov  chain  is  reversible  if  for  all  i  and  j  in  the  state  space 

A  random  walk  on  a  connected  graph  G  =  {V,E}  is  a  reversible,  irre¬ 
ducible  Markov  chain  with  state  space  V  and  for  which  =  l/d{x)  when 
there  is  an  edge  {x,y}  in  E  and  p^.y  =  0  otherwise.  One  may  think  of  this 
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as  placing  a  token  on  a  vertex  of  the  graph  and  then  observing  it  as  at  each 
time  step  it  moves  uniformly  at  random  to  some  neighbor  of  its  present  po¬ 
sition.  This  definition  may  be  generalized  to  cases  in  which  the  transition 
probabilities  at  each  vertex  are  not  uniform,  but  we  will  not  address  this  case. 
The  classical  example  of  a  random  walk  on  a  graph  is  the  “gambler’s  ruin 
problem.”  If  one  considers  a  fair  game,  i.e.  one  in  which  the  probabilities 
of  success  and  failure  are  the  same,  then  gambler’s  ruin  can  be  viewed  as 
placing  a  token  at  some  nonnegative  point  on  the  natural  number  line  and 
asking  how  long  before  the  token  will  reach  either  0  or  some  predetermined 
upper  limit  when  each  play  of  the  game  results  in  a  gain  or  loss  of  1  unit. 

Throughout  this  dissertation  we  will  discuss  random  variables  whose  value 
may  depend  on  the  starting  point  of  the  walk.  For  a  random  variable  X,  we 
will  denote  the  expected  value  of  X  for  a  walk  started  at  ver  tex  ^  by  E^X. 
The  hitting  time  from  x  to  y,  E^Hy,  is  the  expected  number  of  steps  it  will 
take  for  the  token  to  move  from  vertex  x  to  vertex  y.  Note  that  it  is  not 
generally  the  case  that  E^Hy  =  EyH^.  We  will  denote  by  Tf  the  maximum 
hitting  time  for  all  pairs  of  vertices,  i.e.  H  =  maXx,y^vExHy.  The  commute 
time  between  vertices  x  and  y  is  Cx,y  =  E^Hy  +  EyH^ 

Given  a  starting  vertex  x,  and  counting  this  placement  as  a  visit  to  x,  one 
may  ask:  What  is  the  expected  number  of  steps  needed  to  ensure  the  random 
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walk  has  visited  all  the  vertices  of  the  graph  at  least  once?  The  vertex  cover 
time,  C",  is  the  maximum  of  this  value  over  all  starting  points  x.  One  may 
modify  the  question  slightly  and  ask:  What  is  the  expected  number  of  steps 
to  visit  all  the  vertices  and  then  return  to  the  starting  point?  This  time  is 
called  the  vertex  cover  and  return  time. 

We  will  also  use  the  following  notation  to  describe  the  growth  of  a  func¬ 
tion.  When  /  and  g  are  nonnegative  valued  functions  we  define  f{n)  = 
0{g{n))  if  3  N  and  c,  9  Vn  >  N,f{n)  <  cg(n).  Likewise  /(n)  =  Q{g{n)) 
if  3  N  and  c,3  Vn  >  N,f{n)  >  cg{n).  Define  /(n)  =  Q{g{n))  if  /(n)  == 
Q.{g{n))  and  /(n)  =  0{g{n)).  Finally  /(n)  =  o{g{n))  if  limn_*co  f{n)/g{n)  = 
0. 

1.2  Results  for  Vertex  Cover  Time 

In  recent  years  a  good  deal  of  research  has  been  devoted  to  questions  regard¬ 
ing  the  hitting  and  vertex  cover  times  of  random  walks  on  graphs.  Unfortu¬ 
nately,  there  are  only  a  few  cases  where  the  quantities  are  easily  computed 
exactly.  The  following  well  known  recursive  argument  provides  the  hitting 
time  from  one  endpoint  to  the  other  of  a  path  with  n  vertices. 

Assume  that  the  vertices  of  the  path  are  labeled  in  the  natural  way  from  1 
to  n.  Define  Oj  to  be  the  expected  number  of  steps  needed,  starting  at 
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vertex  j,  to  first  reach  vertex  n.  Clearly  an  —  0.  Then  since  a  walk  started 
at  vertex  1  steps  to  vertex  2  with  probability  1  on  its  first  step  we  must  have 


ai  —  1  +  a2- 


Finally  for  1  <  j  <  n  we  have 

aj  =  -(1  +  aj-i)  +  -(1  +  rij+i). 

Now  we  have  a  second  order  recursion  with  two  boundary  conditions  whose 
solution  provides  the  closed  form  description 

aj  =  {n-ir-{j-ir. 

Thus  EiHn  =  in  —  1)^. 

Using  a  similar  recursive  argument  one  may  establish  that  the  vertex  cover 
time  for  a  walk  started  at  vertex  j  is  (5/4)(n—  1)^  —  (l/4)(n  —  2j  +  1)^.  This 
quantity  is  maximized  by  a  walk  started  at  the  center/centers  of  the  path, 
and  has  value  (5/4)  (n  —  1)^  there  when  n  is  odd.  Another  graph  for  which 
a  recursive  argument  provides  the  vertex  cover  time  is  the  cycle,  resulting  in 
^2)  for  the  n  vertex  cycle. 

A  further  simple  graph  one  may  consider  is  the  Star^  Sn,  consisting  of 
n  —  \  vertices  of  degree  1  and  a  single  vertex  of  degree  n  —  1.  In  this  case  a 
“coupon  collector”  type  argument  provides  the  exact  vertex  cover  time. 
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Consider  a  walk  on  Sn  started  at  a  leaf.  Then  one  may  consider  the 
leaves  of  the  star  to  be  coupons.  On  every  second  step  a  coupon  will  be 
chosen.  Upon  first  acquiring  the  new  coupon  consider  how  many  steps, 
on  average,  one  must  take  to  acquire  the  (_7  +  l)*h  Having  acquired  j  distinct 
coupons,  with  probability  {j  —  l)/(n  —  1)  one  takes  two  steps  and  returns 
to  a  leaf  already  visited,  otherwise,  with  probability  {n  —  j  —  l)/{n—  1)  one 
takes  two  steps  and  acquires  a  new  coupon.  Thus  for  j  >  1 ,  if  Cj  represents 
the  number  of  steps  to  acquire  the  (j  +  coupon,  we  have 

Cj  =  ^  ,  (2  +  Cj)  H - — — (2), 

n  —  i  n  —  1 


which  has  solution 


Cj  —  2 


n  —  1 

J^-3-  1, 


Then 


=  2(n  -  1)|;  =  2( 


=  2(n- 


where  hk  =  Y.  (l/j);  the  k,  harmonic  number.  It  is  easy  to  see  that  if  a 
j=i 

walk  starts  from  the  center  of  it  would  take  one  step  to  reach  a  leaf  and 
then  accomplish  a  vertex  cover  as  if  it  had  started  at  that  leaf.  Thus  for  Sn, 


—  2(77.  —  l)/l,i_2  +  1- 

Brightwell  and  Winkler[7]  established  that  the  vertex  cover  time  for  all 
trees  on  n  vertices  is  minimized  by  the  star.  They  conjectured  that  a  random 
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walk  on  the  path  when  started  from  the  center /centers  maximizes  vertex 
cover  time  for  trees,  but  this  has  yet  to  be  proved. 

Modifying  the  “coupon  collector”  argument  above  only  slightly  results 
in  an  easy  proof  that  the  vertex  cover  time  for  the  clique  on  n  vertices  is 
(n-  l)/in-l- 

Brightwell  and  Winkler[6]  determined  the  graph  which  maximizes  hitting 
time  among  all  graphs.  This  graph  is  known  as  the  “lollipop”  and  consists  of 
a  clique  on  approximately  2n/3  vertices  attached  to  a  path  on  the  remaining 
vertices.  They  proved  that  the  hitting  time  in  this  graph  is  maximized  by 
starting  at  a  vertex  in  the  clique  and  proceeding  to  the  tail  of  the  path. 
The  hitting  time  for  such  a  walk  is  (4/27  +  o(l))n®.  They  conjectured  that 
this  graph  also  maximized  vertex  cover  time  for  all  graphs.  Fiege[13]  has 
proved  that  the  maximum  vertex  cover  time  for  all  graphs  is  bounded  above 
by  (4/27)n^  +  lending  credence  to  that  conjecture. 

Since  there  are  few  graphs  for  which  it  has  been  possible  to  compute 
hitting  or  vertex  cover  times  exactly,  a  good  deal  of  work  has  been  done  to 
establish  bounds  on  these  quantities.  One  of  the  earliest  bounds  for  vertex 
cover  time  is  a  consequence  of  a  more  general  theorem  due  to  Matthews[20]. 
This  result  provides  an  upper  bound  on  the  vertex  cover  time  based  on  the 
maximum  hitting  time  of  the  graph. 
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Theorem  1.1  If  G  has  n  vertices  then 


C^<Hhn^H\ogn  (1.1) 

Matthews  developed  this  result  using  Markov  chain  theory  while  studying 
Brownian  motion  on  the  sphere.  While  it  proves  to  be  too  generous  in  some 
cases  there  are  a  number  of  “natural”  cases  in  which  it  is  tight.  One  such 
example  is  the  hypercube  for  which  Aldous[l]  proved  a  cover  time  bound  of 
0(nlogn).  Another  is  the  balanced  fc-ary  tree  for  which  Zuckerman  proved 
=  0(nlog^n). 

Doyle  and  Snell[ll]  make  an  elegant  connection  between  the  theory  of  re¬ 
sistor/capacitor  networks  and  random  walks  on  graphs.  Building  on  this 
foundation  Chandra  [9]  et  al.  prove  that  the  cover  time  for  d-regular  ex¬ 
pander  graphs  is  0(nlogn).  They  also  use  this  theory  to  establish  bounds  of 
0{n log^  n)  for  2  dimensional  toruses  and  0{n  log  n)  for  d  dimensional  toruses 
where  d>  2  and  n  —  for  some  integer  k.  Matthews  bound  is  tight  in  all 
these  cases  as  well. 

While  attempting  to  prove  the  existence  of  universal  traversal  sequences 
Aleliunas[3]  et  al.  established  perhaps  the  first  general  upper  bound  on  the 
vertex  cover  time. 

Theorem  1.2 

C^<2m{n-l).  (1.2) 


8 


Here,  els  usual,  n  denotes  the  number  of  vertices  in  the  graph,  and  m  the 
number  of  edges.  Their  technique  made  use  of  the  fact  that  the  commute  time 
between  any  pair  of  adjacent  vertices  in  a  graph  is  bounded  by  2m.  They 
observed  that  if  one  takes  any  spanning  tree  of  a  graph  and  any  specific 
vertex  x,  there  is  a  covering  walk  which  begins  and  ends  at  x  and  traverses 
each  edge  of  the  spanning  tree  exactly  once  in  each  direction.  Then  the  vertex 
cover  time  is  bounded  above  by  the  average  time  for  a  walk  to  cover  all  the 
vertices  of  the  graph  in  the  order  imposed  by  this  walk.  The  amount  of  time 
necessary  for  such  a  walk  is  just  the  sum  of  the  commute  times  for  each  pair 
of  adjacent  vertices  in  the  spanning  tree  which  is  bounded  by  2m(n  —  1). 

A  trivial  lower  bound  for  the  vertex  cover  time  of  all  graphs  is  n.  Fiege[14] 
has  proven  the  following  lower  bound  which  is  tight  for  several  classes  of 
graphs. 

C'’  >n\ogn.  (1.3) 

Other  authors  have  proved  bounds  on  the  vertex  cover  time  for  specific 
classes  of  graphs  using  the  properties  of  regularity,  the  average  or  maximum 
degree,  and  the  diameter  of  the  graph.  Aldous  has  collected  these,  and  much 
more  about  reversible  Markov  chains,  in  several  chapters  of  a  book[2]  yet  to 
be  published. 
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CHAPTER  II 


A  NEW  BOUND  ON  THE  EDGE 
COVER  TIME 

2.1  Definitions  and  Known  Results 

Given  the  interest  in  vertex  cover  time  it  is  only  natural  to  ask  the  question 
in  terms  of  edges  as  well.  That  is,  given  a  starting  point  x,  what  is  the 
expected  number  of  steps  necessary  to  ensure  that  the  token  has  traversed 
every  edge  of  G  at  least  once.  The  undirected  edge  cover  time,  C”®,  is  the 
maximum  over  all  starting  points  of  this  value.  The  directed  edge  cover  time, 
C®,  is  the  maximum  over  all  starting  points  for  this  value  when  one  asks  the 
question  in  terms  of  traversing  each  edge  of  G  in  both  directions.  In  this 
work  we  will  concentrate  on  answering  some  of  the  questions  related  to  these 
quantities. 

There  appears  to  be  relatively  little  in  the  literature  about  the  edge  cover 
time.  One  of  the  few  results  is  a  global  bound  due  to  Zuckerman[27]. 
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Theorem  2.1  The  directed  edge  cover  time  for  a  graph  G  is  0{nm). 

In  his  paper  Zuckerman  actually  establishes  two  upper  bounds.  The  first 
is  the  “natural”  bound  derived  from  Matthews  [20]  results.  It  is  easy  to  argue 
that  the  maximum  hitting  time  from  one  directed  edge  to  another  in  a  graph 
is  bounded  above  by  (2m  +  H).  Thus,  using  Matthews’  result,  and  the  fact 
that  the  sequence  of  directed  edges  traversed  during  a  random  walk  also  forms 
a  Markov  chain,  one  obtains  =  0{{rn  +  H)  log  m).  Zuckerman  points  out 
that  this  bound  is  tight  for  those  graphs  having  C"  =  ©(i^logn),  such  as 
cliques,  hypercubes,  and  k-dimensional  toruses.  However,  in  the  general  case 
this  natural  bound  yields  0{nm\ogn).  The  path  is  an  example  where  this 
fails  to  be  tight,  for  it  is  clear  that  the  edge  cover  time  for  a  path  is  bounded 
by  2n^,  while  this  bound  provides  only  0(n^  log  n). 

In  order  to  remove  the  extra  logn  factor  in  the  “natural”  bound  Zuck¬ 
erman  makes  use  of  a  renewal  type  argument  to  show  that  C®  =  0{k{C'"  + 
m^+^/^)),  for  any  integer  k.  Letting  A;  =  2  here  provides  the  0{mn)  bound. 
This  yields  bounds  of  O(n^)  for  the  path  and  0(n®)  for  the  “lollipop,”  which 
are  tight.  However,  this  also  yields  0(n“),  with  a  >  2,  for  the  clique.  This 
exceeds  the  “natural”  bound  of  O(n^logn),  which  is  also  tight.  The  fact 
that  neither  of  these  bounds  is  tight  in  all  known  cases  led  us  to  search  for 
another  means  to  bound  the  edge  cover  time. 
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2.2  A  New  Global  Upper  Bound 


Intuitively,  if  one  focuses  on  the  edges  emanating  from  a  specified  vertex  x, 
then  the  amount  of  time  necessary  to  traverse  all  the  edges  out  of  x  may  be 
found  by  a  “coupon  collector”  argument.  It  is  well  known  that  the  expected 
amount  of  time  to  collect  all  of  the  coupons  in  a  set  of  size  n  is  approximately 
nlogn.  Thus  it  seems  plausible  that  if  we  visit  vertex  x  a  constant  multiple 
of  d{x)\ogd(x)  times,  then  with  high  probability  we  should  have  traversed 
all  the  edges  out  of  x.  Now  if  we  consider  a  regular  graph  of  degree  d  with  n 
vertices  and  m  =  dn  directed  edges,  and  i/we  were  near  stationarity,  then, 
since  the  stationary  distribution  has  itj  =  d/m,\/j,  the  number  of  steps,  k, 


needed  would  be  given  by 


d  log  d=^  k  ^  m  log  d. 


Since  d  <  n  this  would  indicate  that  k  dnlogn  should  certainly  suffice  in 
such  cases.  The  question  becomes:  Is  there  a  way  to  quantify,  in  the  general 
case,  how  many  times  one  has  visited  each  vertex  on  a  “long”  walk  through 


the  graph? 


We  know  that  eventually  the  frequency  with  which  each  vertex  is  visited 
approaches  the  value  determined  by  the  stationary  distribution.  Winkler 
and  Zuckerman[25]  have  defined  the  blanket  time  of  a  graph,  to  be  the 
expected  time  for  a  random  walk  to  have  hit  every  vertex  of  G  within  a 
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multiplicative  constant  factor,  e  <  1,  of  the  number  of  times  predicted  by 
the  stationary  distribution.  Specifically,  letting  Nx{t)  be  the  number  of  times 
vertex  x  is  visited  by  time  t,  they  define  =  min{t ;  {\/x)Nx{t)  > 
and  set  =  maxxev  ExBe.  Clearly  =  Q.{C'").  They  conjecture  that 
Be  =  0(C'").  They  prove  their  conjecture  in  the  cases  of  the  path,  the  cycle, 
and  those  graphs  for  which  C"  =  ^(Blogn). 

Using  the  concept  of  Blanket  Time  and  letting  d  ~  {Ylxev  d{x))  /n,  the 
average  degree  of  a  vertex  in  the  graph,  we  now  claim: 

Theorem  2.2 

=  0{dB,).  (2.1) 


Proof 

Since  B^  is  an  expected  value,  we  know  from  Markov’s  Inequality  that 
Vc>  0, 

P{3x  e  K  9  NxicdB,)  <  (1  -  e)  TTxCdB,}  <  i.  (2.2) 

cd 

Thus  with  probability  1  —  l/cd  we  have  Vx  €  V,  Nx{cdBe)  >  (1  —  e)  nxcdB^. 

On  each  visit  to  a  vertex  x  the  probability  that  a  specified  directed  edge, 
say  {x,y),  is  not  traversed  on  the  next  step  of  the  walk  is 


Pr[edge  {x,y)  is  not  traversed] 


-1 

<  . 
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Therefore 


Pr[edge  {x,y)  is  not  traversed  after  q  visits  to  a;]  <  e^. 

So  in  those  cases  where,  after  taking  q  =  cdB^  steps(c  to  be  chosen  later), 
we  have  “blanketed”  we  have 

_  — (1— £)7ricJBe 

Pr[edge  {x,y)  is  not  traversed  after  alB^  steps]  <  e  ^  . 

Using  the  fact  that  =  d{x)/m  we  have 

~  1  — (1— e)cfrjBr; 

Pr[edge  (x^y)  is  not  traversed  after  cdB^  steps]  <  e  ^ 

Note  that  this  value  is  independent  of  the  particular  edge  we  are  considering 
so 

—  — (1— e)cr7Be 

Pr[3  an  edge  {x/y)  which  is  not  traversed  after  cdB^  steps]  <  me 
Using  the  fact  that  m  —  dn  we  have 

_  —  ^  —  — (1— c)cBg 

Pr[3  an  edge  {x,y)  which  is  not  traversed  after  cdB^  steps]  <  dne  ^ 

Combining  this  with  B^  >  C"  and  inequality (1.3)  yields 

Pr[3  an  edge  {x,  y)  which  is  not  traversed  after  cdB^,  steps]  <  . 

Then  since  d  <  n,  and  if  we  restrict  e  <  (1/2),  we  observe  that  choosing  c  >  6 
provides  an  upper  bound  of  1/n  on  this  probability.  If  we  restrict  ourselves 
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to  considering  graphs  on  n  >  12  vertices,  and  combine  this  probability  with 
that  in  equation(2.2),  we  may  now  claim  that 

Pr[we  have  not  traversed  all  edges  after  \2dB^  steps]  < 

6 


Using  the  independence  properties  of  random  walks  we  may  then  say 
Pr[Number  of  steps  to  complete  an  edge  cover  >  jl2dBe  steps]  < 

This  fact  provides  the  result  in  our  theorem.  ■ 


If  Winkler  and  Zuckerman’s  conjecture  that  B^  —  0{C^)  is  correct,  then 
we  have  the  new  general  bound  for  the  edge  cover  time,  0{dC^).  Unfortu¬ 
nately  this  bound  proves  to  overestimate  the  order  of  the  edge  cover  time  on 
the  lollipop.  Winkler  and  Zuckerman  have  proved  however,  that  the  “blan¬ 
ket”  time  is  0{C'^’)  in  all  the  “natural”  cases  such  as  cliques,  hypercubes 
and  expanders,  as  well  as  for  the  path  and  cycle.  Thus  we  may  use  0{dC'") 
in  those  cases,  which  turns  out  to  be  an  improvement  over  using  either  of 
Zuckerman’s  bounds  in  isolation. 
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CHAPTER  III 


THE  DIRECTED  EDGE  COVER 
TIME  OF  A  WALK  ON  THE  PATH 

3.1  Starting  from  an  Endpoint 

It  was  mentioned  in  Chapter  1  that  one  of  the  classical  examples  of  a  random 
walk  is  the  “Gambler’s  Ruin  Problem,”  a  probabilistic  game  played  on  the 
path.  We  also  mentioned  that  the  path  is  conjectured  to  be  the  graph  which 
maximizes  vertex  cover  time  for  walks  on  trees.  The  path  also  plays  a  central 
role  in  questions  concerning  directed  edge  cover’  times.  In  this  chapter  we 
determine  the  directed  edge  cover  time  for  a  walk  begun  at  an  endpoint  of 
a  path.  We  also  obtain  a  bound  on  the  directed  edge  cover  time  for  a  walk 
begun  at  any  point  of  the  path  and  show  that,  as  for  the  vertex  cover  time, 
this  quantity  is  maximized  for  a  walk  begun  at  the  center  of  the  path,  when 
it  is  long  enough.  (Somewhat  surprisingly,  however,  the  directed  edge  cover 
time  is  not  minimized  by  starting  at  an  endpoint.) 
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We  will  use  P„  to  denote  the  path  on  the  integers  [1 . . .  n].  That  is  the 
tree  on  the  vertex  set  [1 ...  n]  in  which  for  1  <  j  <  n  there  are  edges  {j  —  l,j} 
and  {j,j  +  1}.  We  will  first  investigate  the  edge  cover  time  for  a  random 
walk  on  Pn  which  starts  from  an  endpoint.  We  will  consider  the  endpoint  at 
which  the  walk  starts  to  be  labeled  vertex  1  and  the  other  endpoint  to  be 
vertex  n. 

It  is  obvious  that  the  undirected  edge  cover  time  in  this  case,  and  for 
all  trees,  is  the  same  as  the  vertex  cover  time.  Here  we  wish  to  compute 
the  expected  directed  edge  cover  time  for  this  walk.  After  a  little  thought  it 
becomes  apparent  that  prior  to  completing  a  directed  edge  cover  a  walk  will 
first  complete  a  vertex  cover  of  the  path,  and  that  the  vertex  cover  portion 
of  the  walk  will  end  upon  its  first  visit  to  vertex  n.  From  that  point  the  walk 
must  continue  until  it  traverses  the  directed  edge  closest  to  vertex  1  which 
was  not  traversed  during  the  vertex  cover  portion  of  the  walk.  The  expected 
vertex  cover  time  for  such  a  walk  is  the  same  as  the  amount  of  time  necessary 
for  a  walk  started  at  vertex  1  to  first  arrive  at  vertex  n,  i.e.  the  hitting  time 
from  1  to  n.  As  was  mentioned  in  Chapter  1,  EiHn  =  {n  —  1)^.  It  remains 
then  to  determine  the  additional  number  of  steps  needed  to  complete  the 
edge  cover.  As  we  contemplated  how  to  go  about  doing  this  many  questions 
arose.  How  many  edges,  on  average,  were  left  uncovered  during  the  vertex 
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cover  walk?  How  far  back  onto  the  path  was  the  first  missed  edge?  On 
average  how  far  back  should  we  expect  to  go?  We  answer  these  questions  by 
first  computing  the  probability,  for  each  edge  {k,k  —  1),  that  it  is  the  edge 
closest  to  vertex  1  not  traversed  during  the  vertex  cover  portion  of  the  walk. 
Then  for  each  edge  we  will  multiply  this  probability  by  the  hitting  time  from 
vertex  n  to  vertex  k  —  \.  Finally,  adding  these  quantities  yields  the  expected 
number  of  additional  steps  to  complete  the  edge  cover.  Once  we  have  this 
description  we  will  prove  that  the  expected  additional  number  of  steps  taken 
after  the  vertex  cover  is  completed  is  bounded  by  a  term  of  lower  order  than 
the  vertex  cover  time.  Specifically  we  will  establish  the  following: 

Theorem  3.1  The  expected  num,ber  of  steps  needed  to  ensure  a  simple  ran¬ 
dom  walk  started  at  an  endpoint  of  Pn  has  traversed  each  edge  in  the  path  in 
both  directions  is 

{l+o{l))n\ 

Indeed,  we  will  actually  show  that  after  completing  a  vertex  cover,  the 
expected  number  of  additional  steps  needed  to  complete  an  edge  cover  is 
0(n^/logn). 
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3.2  The  Probability  Distribution 


For  each  integer  k,  2  <  k  <  n,  we  wish  to  compute  P{X'^),  the  probability 
that  k  is  the  least  integer  such  that  edge  {k,k—  1)  was  not  traversed  during 
the  vertex  cover  portion  of  the  random  walk.  We  will  do  this  by  relating  the 
probability  distribution  for  P„  to  the  probability  distribution  of  Pn-i- 


Lemma  3.2  For  2  <  k  <  n  —  1, 

and 

P(X:)Pt{^^)p(Xr).  (3.2) 

The  proof  of  this  lemma,  and  others  to  follow,  will  make  frequent  use  of 
the  following  easily  established  result. 


Proposition  3.3  Given  a  random  walk  on  Pn  started  at  vertex  j,  the  prob¬ 
ability  that  the  walk  reaches  vertex  1  prior  to  vertex  n  is 


Proof 

Let  rj  be  the  probability  that  starting  at  vertex  j  the  walk  reaches  vertex  1 
prior  to  vertex  n.  Then  ri  =  l,r„  =  0  and 


r 


j  ~ 


rj+i- 
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The  solution  to  this  recurrence  relation  is  (n  —  j)/{n  —  1),  as  claimed. 


Proof  of  Lemma  3.2 


Consider  a  random  walk  started  at  vertex  1  on  Upon  first  visiting 
vertex  n  —  1  this  walk  completes  a  vertex  cover  walk  for  Pn-i-  Thus  the 
probability  distribution  of  untraversed  edges  at  this  point  is  2  < 

k  <  n  —  1.  With  probability  (n  —  k)/(n  —  k  +  1)  the  walk  will  reach  vertex 
n  prior  to  returning  to  vertex  k  —  1.  In  this  case,  if  k  is  the  least  integer 
such  that  edge  {k,k  —  1)  was  not  traversed  in  the  walk  prior  to  reaching 
vertex  n  —  1,  then  it  remains  so  when  vertex  n  is  reached.  All  of  these  walks 
then  contribute  to  P{X'^).  This  means  that  for  2  <  A:  <  n  —  1, 

This  leaves  only  those  walks  that,  upon  first  reaching  vertex  n  —  1,  step 
back  to  vertex  A;  —  1  prior  to  their  first  visit  to  vertex  n.  The  probability 
that  this  occurs  is  l/(n  —  A;  +  1).  Each  of  these  walks  has  the  property 
that  edge  {n,n—  1)  is  the  only  edge  not  traversed  when  the  walk  first  visits 
vertex  n.  This  gives  us 
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Clearly  P(X^)  =  0  for  n  <  k.  We  may  use  this  fact  to  reduce  P(XJ^)  to 
a  multiple  of  P{X^).  Iterating  (3.1)  yields 

^  ft ,  =  (^drn) 

Using  this  fact,  (3.2)  becomes 

Define  the  sequence  {fn}  by  fn-i  =  PiX^).  Then  by  equation(3.3) 

But  the  values  of  P{X^),  for  2  <  k  <  n,  represent  a  probability  distribution 
on  the  edges  of  P„,  so 

E  P(x:)  =  1. 

k=2 

This  leads  to  one  of  the  many  interesting  identities  satisfied  by  the  sequence 
{fn},  namely 

This  sequence  will  play  a  central  role  in  some  of  the  arguments  which  follow, 
and  we  will  rely  on  this  identity  throughout. 

Remark:  For  future  use  we  will  define  /o  =  0.  A  more  detailed  study  of 
the  properties  of  the  sequence  {/„}  is  contained  in  Appendix  A. 
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It  is  worth  noting  our  interpretation  of  the  elements  of  the  sequence  {/n}- 
When  the  walk  first  visits  vertex  k  the  probability  that  it  reaches  vertex  n 
prior  to  returning  to  vertex  /c  —  lisl/(n  —  fc+1).  Thus  looking  back  to 
equation(3.4)  we  see  that  fk-i  represents  the  conditional  probability  that, 
given  that  the  edge  {k,k  —  1)  was  not  traversed  prior  to  the  walk  reaching 
vertex  n,  it  is  the  closest  such  edge  to  vertex  1. 

Remark;  Clearly  upon  first  completing  a  vertex  cover  of  the  path  the 
walk  has  failed  to  traverse  at  least  one  edge,  namely  (n,  n  —  1).  Based  on  the 
observations  above  we  know  that  the  ei:pected  number  of  edges  which  were 
not  traversed  during  the  vertex  cover  portion  of  the  walk  is 

n  1 

S  n-*:  +  l 

Remark:  We  may  also  now  determine  on  average  how  far  back  onto  the 
path  we  must  go  to  complete  the  edge  cover. 

pn-k^l){^^)p(X;)=’'tu  (3.6) 

In  Appendix  A  we  prove  that  /„  ~  i^/hn),  and  by  combining  a  lower 
bound  on  /„  with  techniques  similar  to  those  used  in  Lemma  3.4  prove 
that  (3.6)  is  9{n/\ogn).  If  one  could  establish  this  bound  on  (3.6)  directly 
then  Lemma  3.4  would  be  an  immediate  consequence. 
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3.3  Computing  the  Expected  Directed  Edge 


Cover  Time 


Given  the  probability  distribution  discussed  in  Section  3.2  we  can  now  bound 
the  expected  directed  edge  cover  time  for  Using  the  hitting  time  result 
of  Section  1.2  we  find  that  the  hitting  time  from  vertex  n  to  vertex  A;  —  1  in 
Pn  is  {n  —  k  +  1)^.  If  we  let  EiC^  denote  the  expected  directed  edge  cover 
time  for  a  random  walk  started  at  vertex  1  on  P„,  then 

EiC‘  =  (n  -  1)"  +  f;  (n  -  *  +  lfP{X^). 

k=2 

Combining  this  with  equation(3.3)  yields 

k=2 

=  {n-lf  +  J2in-k)h.  (3.7) 

fc=i 

So  to  prove  that  the  expected  directed  edge  cover  time  is  (1  +  o(l))n^  we 
need  only  show  that  Z]  (u  —  k)fk  is  o{rr). 

k=l 

In  order  to  bound  this  expected  number  of  additional  steps  needed  after 
the  vertex  cover  is  complete  we  will  use  the  theory  of  generating  functions. 
Let  the  sequence  {gn}  be  defined  by 

71—1 

<7^-1  {n-  k)fk.  (3.8) 

fc=i 
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Lemma  3.4  Given  gn-i  as  in  equation (3. 8),  then  gn-i  =  O  • 

Proof 

The  ordinary  generating  function  for  the  sequence  {/„},  F{x),  may  be 
found  from  the  identity  in  equation (3. 5).  If 


or 

^  {1  -  x)\n{l  -  x)' 
Let 

Gix)  = 

n>l 

Then  using  definition(3.8)  we  have 


G{x)  =  F{x) 


(1— a;)^  (1  —  ln(l  —  a;)  (1  —  x)' 


:B{-x),  (3.9) 


where  B{x)  —  bkX^  is  the  ordinary  generating  function  for  the  Bernoulli 

fc>0 

numbers  of  the  second  kind,  {bn},  defined  and  studied  by  Jordan[17].^  The 
^D.  Fielder [15]  communicated  our  interest  in  this  generating  function  to  P.  Bruckman[8] 
who  pointed  out  its  relation  to  similar  functions  in  Howard[16],  which  in  turn  led  us  to 
the  work  of  Jordan. 
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elements  of  the  sequence  {bn}  are  coefficients  in  the  Bernoulli  Polynomials 
of  the  2^  kind  given  by 

Equation(3.9)  may  be  written  as 

=  (T^  (S  ’ 

which  makes  it  clear  that 

[x^]G(x)  =”e  (3.10) 

It  also  follows  immediately  from  (3.9)  that 

(i^)  F{x)  =  B(-x) 

which  shows  that 

u+i  -  u  =  i-irb,..  (3.11) 

Since  A’k„(a;)  =  'kn-i(ic),  where  A  is  the  difference  operator,  it  follows 
that  the  coefficient  bi  in  'k„(a;)  is  independent  of  n,  and  it  is  easily  seen  that 

I  (^]dt=^n+lil)-'^n+l{0)  =  bn.  (3.12) 

Combining  this  with  (3.11)  and  +  (*Zi)  leads  to 

o'  ' 
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It  also  follows  from  the  integral  representation  of  bn  in  (3.12)  that  (3.10)  can 
be  written  as 


0  j=0 


By  considering  the  coefficient  of  a;”  in  (1  —  x)*  we  see  that 


n— 1 

E(-i) 

j=0 


n-j-1  P  +  J  ~  1 

V  j  > 


(1— 

^  t  ' 


=  (-ir 


.n  —  1, 


So  equation(3.14)  becomes 


A  \  > 


(it. 


In  order  to  obtain  an  upper  bound  for  Qn-i  note  that 


0  ^  ' 


(It 


{t  —  {n  —  l)){t  —  n)dt 


(3.14) 


(3.15) 


^  {t^  —  {2n  —  l)f  +  n{n  —  l))(lt.  (3.16) 


Using  the  fact  that  (1  —  a:)  <  e~^,  we  can  bound  (3.16)  from  above  by 
1 

e 


=  *(3+4+  '+n-2)(^2  _  ^2n  —  l)t  +  n{n  —  l))dt. 


0 
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Then  if  we  let  s  =  hn-2  —  |  we  have 


an-i  <  Q  j  J  e  -  (2n  -  l)t  +  n{n  -  l))dt.  (3.17) 


Integration  by  parts  in  inequality(3.17)  leads  to 


Tl^  f  1  \ 

- 1)  (' " 


Hence 


On-l  < 


n 


21n(n-2)  -3^ 


+  o(l)  =  0 


n 

Inn 


Proof  of  Theorem  3.1 

The  proof  of  Theorem  3.1  is  immediate  when  one  combines  equations (3. 7) 
and  (3.8)  with  the  result  of  Lemma  3.4.  ■ 

Remark:  In  Appendix  A  we  show  that  gn-i  =  S2(n^/logn).  So  the 
expected  additional  number  of  steps  needed  to  complete  a  vertex  cover  walk 
to  an  edge  cover  walk  is  actually  d{n^/\ogn). 


3.4  Starting  from  an  Arbitrary  Point 

In  Sections  3. 1-3.3  we  described  how  one  could  compute  the  edge  cover  time  of 
a  simple  random  walk  on  Pn  started  at  vertex  1,  using  coefficients  we  chose  to 
label  fn-  We  have  that  after  completing  a  vertex  cover  the  expected  number 
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of  additional  steps  needed  to  ensure  an  edge  cover  is  0(n^/logn).  However, 
the  definition  of  edge  cover  time  is  the  maximum  of  the  edge  cover  times 
calculated  for  all  possible  starting  points.  As  was  mentioned  in  Section  1.2, 
the  vertex  cover  time  for  a  path  is  maximized  for  walks  started  from  the 
center.  This  was  easily  seen  by  maximizing  the  solution  to  the  recurrence 
formula  mentioned  there.  It  has  been  conjectured,  although  still  not  proven, 
that  the  vertex  cover  time  for  all  trees  is  maximized  by  a  random  walk  on  the 
path  started  at  the  center.  Unfortunately,  there  does  not  appear  be  such  a 
simple  description  of  the  edge  cover  time  for  a  walk  on  the  path.  Nonetheless, 
we  have  been  able  to  establish  a  tight  bound  on  the  edge  cover  time  for  walks 
started  at  an  endpoint.  Now  we  wish  to  generalize  that  result  to  the  case  in 
which  we  start  the  walk  from  an  arbitrary  vertex  1  <  j  <  (n  +  l)/2.  With 
this  information  we  will  answer  some  of  the  questions  posed  above  for  the 
edge  cover  case.  Note  that  due  to  the  symmetry  in  the  path  we  need  not 
concern  ourselves  with  walks  started  beyond  the  center/centers  of  the  graph. 

We  will  prove  that  for  n  >  9  the  expected  edge  cover  time  on  is  a 
strictly  increasing  function  of  the  starting  point,  j,  for  2  <  j  <  {n  +  l)/2. 
Combining  this  with  easily  computed  results  for  paths  on  nine  or  less  vertices 
we  will  then  be  able  to  deduce  that  the  expected  edge  cover  time  on  a  path 
is  maximized  in  walks  started  from  the  center/centers  when  n>  8  and  from 
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the  endpoint  for  n  <  8.  Moreover  we  will  be  able  to  prove  that  the  additional 
number  of  steps  needed  to  complete  an  edge  cover  for  a  walk  started  at  the 
center/centers  of  a  path  is  0{n‘^/\ogn).  An  interesting  fact  that  we  will 
establish  along  the  way  is  that  the  edge  cover  time  for  a  walk  started  at  an 
endpoint  on  Pn,n  >  3,  always  exceeds  that  of  a  walk  started  at  its  point  of 
attachment,  thus  this  time  is  not  then  minimized  by  starting  at  an  endpoint. 
It  turns  out  that  this  is  the  case  in  all  graphs,  for  all  leaves,  which  we  will 
prove  in  Chapter  4. 

Recall  that  when  considering  a  walk  started  at  vertex  1,  our  strategy 
was  first  to  make  use  of  the  expected  vertex  cover  time  which  was  nothing 
more  than  the  hitting  time  from  vertex  1  to  vertex  n.  We  then  computed 
for  each  k,  2  <  /c  <  n,  the  probability  that  k  was  the  least  integer  such  that 
edge  {k,k  —  l)  was  not  traversed  during  the  vertex  cover  portion  of  the  walk. 
Multiplying  the  probability  for  each  k  by  the  hitting  time  from  nto  k—1  and 
adding  the  results  together  then  gave  us  the  expected  number  of  additional 
steps  necessary  to  complete  an  edge  cover  walk. 

Now  we  need  the  vertex  cover  time  from  an  aibitrary  starting  point. 

Proposition  3.5  On  the  path,  Pn,  labeled  in  the  natural  way,  the  vertex 
cover  time  for  a  random  walk  started  at  vertex  j  is 

EjC"  =  -  2j  +  1)1  (3.18) 
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As  w£LS  mentioned  in  Section  1.2,  the  proof  of  this  result  is  similar  to 
the  recursive  proof  that  the  hitting  time  from  one  endpoint  to  the  other  is 
{n-lf. 

Unfortunately  no  such  simple  recursive  argument  appears  to  be  available 
for  the  computation  of  the  expected  edge  cover  time.  In  order  to  use  our 
previously  successful  method  we  must  condition  the  probability  distribution 
on  the  last  endpoint  visited  during  the  vertex  cover  portion  of  the  walk. 
Once  we  have  found  such  a  conditional  probability  distribution  we  can  again 
multiply  the  probability  for  each  edge  by  the  hitting  time  from  the  given 
endpoint  to  the  far  end  of  that  edge.  Summing  these  products  will  then  give 
us  the  expected  number  of  additional  steps  needed. 

We  will  use  the  following  notation. 

-  the  event  that  k  is  the  least  integer  such  that  a  random 
walk  on  started  at  j,  upon  completing  a  vertex  cover, 
has  not  traversed  edge  {k,k  —  1). 

-  the  event  that  k  is  the  greatest  integer  such  that  a  random 
walk  on  P„,  started  at  j,  upon  completing  a  vertex  cover, 
has  not  traversed  edge  {k,k  +  1). 


30 


-  the  event  that  a  random  walk  started  at  j  completes  a 
vertex  cover  at  n,  i.e.  visits  vertex  1  prior  to  n. 

A'j~^^  -  the  event  that  a  random  walk  started  at  j  completes  a 

vertex  cover  at  1,  i.e.  visits  vertex  n  prior  to  1. 

The  following  statements  are  immediate. 

PiYy,\A]-^-)  =  0  Yk 

PiXy\A]-y  =  0  VA; 

PiXy\A]-n  =  0  l<A;<j 

P{Yj:,\A]-y  =  0  j<k<n-l. 

Then  we  must  determine 

P{Xy  I  A]-^)  j  +  l<k<n 

P{YJ:,  I  Ar^)  l<k<j-l. 

Noting  the  symmetry  of  a  random  walk  on  we  have 

I  I  <r”n).  (3.19) 

Thus  it  will  suffice  to  obtain  a  characterization  of  I 
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Lemma  3.6  For  a  random  walk  on  Pn,  n>  3,  any  starting  vertex  j,  and  k 
satisfying  j  +  l<k<n  —  1,  we  have 


P{Xl,\A'r')  = 


/ n-  l\ 

V  n-2  J  \n  —  j  J 


'  n  —  k 
n  —  k  +  1 


p{x: 


n—l 

j,k 


and  for  A:  =  n 


I 


1— ►n 


)  = 


j-i 


+ 


(n-2)(n-j) 

(n-l 


n 


n—l 


n 


k  + 


1— +n— 1\ 


Proof 

We  are  now  concentrating  on  random  walks  on  P„  that  start  at  j  and  com¬ 
plete  a  vertex  cover  upon  first  stepping  to  vertex  n.  Notice  that  all  of  these 
walks  complete  a  vertex  cover  of  Pn-i  before  yielding  a  vertex  cover  of  P„.  We 
will  use  this  fact  to  relate  the  probability  distribution  of  P{Xf  f^  |  ^4]“"”“^), 
j  +  l<k<n  —  l,to  that  of  P(Wj^fc  I  j  +  I  <  k  <  n. 

Upon  first  arriving  at  n  the  walk  has  covered  all  vertices  in  P„  and  the 
last  new  vertex  visited  prior  to  n  must  have  been  either  1  or  n  —  1.  We  will 
compute  the  probability  distribution  of  P{Xff,  |  Jf  +  1  <  A:  <  n,  by 

considering  these  two  subcases  separately. 


Case(l):  Suppose  the  last  new  vertex  visited  prior  to  n  was  n—l.  Then  we 
need  to  compute  the  probability  that  this  walk  visited  1  prior  to  n  —  1,  given 
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that  it  visits  1  prior  to  n.  Using  elementary  probability  we  have 


P(A] 


yl  -)  = 


However,  ^(^4]“"”  |  ^)  =  1,  so  for  1  <  j  <  n  —  1, 

p(^l^n-l  I  ^  ) 

I  )  p(A]-^) 


(^) 

(s) 


n-  j  -  1\  f  n  -  1 


77.-2  J  \n  —  j 


7  .  (3.20) 


All  of  these  walks  that  visit  1  prior  to  n  —  l(and  prior  to  n)  begin  as 
vertex  cover  walks,  started  at  j,  for  P„_i.  So  P{XJ^^  \  is  well 

defined  in  these  cases.  Using  Proposition  3.3  we  know  that  upon  first  reaching 
vertex  n  —  1  the  fraction  {n  —  k) / {n— k  +  \)  oi  them  will  reach  vertex  n  before 
returning  to  vertex  A;  —  1 .  Thus  we  have  for  j  +  1  <  A;  <  ri  —  1 , 

P{Xl,  I  4—  n  P(xr,'  I  A}--').  (3.21) 

The  quantity  P{X'^^  \  A]”"”)  for  j  +  1  <  A;  <  ri  —  1,  can  now  be  obtained 


by  making  use  of 

p™  P(X"^n2l7“n2l7“-‘)  +  P(Xj»n2l7"nyir‘-') 

P(X^.k\A,  )- 

Note  that  PiX'^j^  fl  n  =  0,  for  j  +  1  <  A;  <  n  —  1,  from  which 


it  follows  that 


P{Xl,  I  A)-) 


n  A]-^  n  A]^"-i) 
P(A]-*") 
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n  A]--^  n  P{A]-^^  n  A]-^^-^) 

P{A]--^  n  1)  P{A)-^^) 

=  P{xii,  I  A]-^'^  n  I  A]-^^). 


Combining  this  with  (3.20)  and  (3.21)  yields  for  j  +  1  <  A;  <  n  —  1 


P{X: 


nx^k 


■^). 


Within  Case  (1)  this  leaves  only  those  walks  which  reach  vertex  A;  —  1 
before  vertex  n.  The  probability  that  this  occurs,  given  that  the  walk  visited 
1  prior  to  n  —  1,  is  l/(n  —  A:  +  1).  These  will  contribute  to  P(X"„  |  A]“*'”). 


Case  (2);  Suppose  the  last  new  vertex  visited  prior  to  n  was  1.  In  this  case 
the  walk  visited  n  —  1,  1,  and  n,  in  that  order,  so  we  have 


P(^” 


n— 1— >1 


P{Ay^-^  n  A]--^) 

P(Tp^) 

(S) 

j-i 


(3.22) 


{n-2){n-j)' 

Notice  that  the  only  edge  which  these  walks  have  not  traversed  upon  reaching 
n  is  {n,n  —  1).  Thus  all  of  these  walks  belong  to  the  class  of  walks  that 
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complete  a  vertex  cover  at  n  and  have  k  =  n  as  the  least  integer  such  that 
edge  {k,k  —  1)  has  not  been  traversed. 

Combining  this  fraction  with  the  walks  described  in  the  latter  part  of 
Case  (1)  results  in 


I  Aj-*")  =  I  + 


n— 1 


fc=j+i 

Using  equation(3.22)  we  have 


fn-l 


n 


p{xi„  I  A'j-n  = 
j  -  I 

(n-2){n-j)  ^ 
/n-j-l\  f n-  1\  ^ 
V  n-2 


1 


n  —  A;  +  1 


n— 1 


pixrk 


►n— 1 


)• 


3.5  Relation  between  P{^j[k  I  and  {/^} 

Lemma(3.6)  provides  us  with  recursive  formidas  for  the  probabilities  we  need 
in  terms  of  the  probability  distribution  on  a  path  containing  1  less  vertex. 
This  is  not  very  useful  in  practice.  Notice  that  if  we  set  j  =  1  we  have  exactly 
the  recurrence  that  we  found  in  our  analysis  of  walks  started  from  vertex  1. 
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This  suggests  that  we  may  be  able  to  reduce  our  recursive  expression  to  a 
more  tractable  form.  Rearranging  and  iterating  we  have  for  ji  + 1  <  A:  <  n~l, 


P{Xl,  \  A]--)  = 


3 

n 


k 


fn-l 


—  /u  +  1 


n-  j 


n  —  2 


P{Xl 


/■n—l  I  ^1— >71— !■( 


=  (^)  (;dm)  ^  I 


►n— 2\ 


n  —  1 

—  r 


'k-j 


n 


(3.23) 


P{X^i^  I  Aj~'^)  =  0  for  n  <  A;  so  this  is  as  far  as  we  may  reduce.  Also, 
since  the  P{X'^j^  \  ior  j  +  1  <  k  <  n,  represent  a  probability  distribu¬ 


tion  we  have 


PiXf,u  I 

=  1-  E  Pixi 

fc=j+i 
—  1 


Ap”) 


=  1 


n— 1 

^-3J  k^+i 


n 


iPy)  P(Xl  I  (3.24) 


These  results  have  reduced  the  amount  of  information  we  need  in  order 
to  use  the  stated  probabilities  to  determine  the  expected  edge  cover  time 
starting  from  j.  Our  next  step  is  to  find  a  description  of  |  AP^^)  in 

terms  of  quantities  we  already  know. 
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In  Section  3.3  we  provided  a  tight  bound  on  the  expected  number  of  ad¬ 
ditional  steps  needed  to  complete  an  edge  cover  having  already  accomplished 
a  vertex  cover  starting  from  an  endpoint.  Our  bound  relied  on  a  connection 
between  the  probability  that  a  particular  edge  was  the  first  missed  during 
the  vertex  cover  walk  and  our  coefficients  /„.  It  seems  that  there  should  be 
some  connection  between  the  edge  probabilities  we  have  calculated  for  the 
more  general  case  and  the  /„. 

R.emember  that 

Combining  this  with  equation(3.24)  leads  to  the  equation 


■  (s) 

n— 1 


or 


n-j 


I  =  2: 

k=j+l 


P(Xl  I  (3.25) 


\n 

When  j  =  1  this  is  the  recurrence  which  helped  define  the  coefficients  /„  in 
equation(3.5). 


Lemma  3.7  For  fixed  k, 


=  (/.-/.-of*"  ^ 

r=l  V 


k  —  r 


(3.26) 
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Proof 


We  will  prove  the  lemma  by  showing  that,  for  fixed  k,  the  two  sequences 
dependent  on  j  are  the  same  for  j  =  I  and  satisfy  the  same  defining  recur¬ 
rence  relation,  given  in  equation(3.25). 

Consider  the  case  when  j  =  1.  Then 


where  the  first  equality  is  due  to  the  fact  that  =  1,  and  the  second 

comes  from  the  definition  of  fk-i-  But,  remembering  that  /o  is  defined  to 
be  0, 

So  the  claim  is  true  when  j  =  1. 


Putting  s  =  k  —  j  we  have 


k—j 


k=j+i  Vn-fc  +  l/^  V  k 


k-  j  -r  -\- I 


n-3 


=  E 

5=1 

n~j 

=  E 


1 


\  \n-s-j+l 
n  —  J  —  r  +  1 


r=l 


n  —  r 


E(A  -  /.-■) 

.r=l 

E(/.-/.-i) 


Ls=l 


s  -  r  +  1 
-  r  + 

1 

r  —  s  +  1 


(3.27) 


For  r  >  1 


§  -  5  l)  2  ((r  +  1)  -  s) 


-  Efs- 
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and  relying  on  the  definition  of  {/«},  we  know  since  /o  =  0 


So  for  r  >  1  the  inner  sum  in  (3.27)  is  identically  0  leaving  only  the  term 
r  =  1  which  is  {n  —  j)/{n  —  1).  ■ 

3.6  Analysis  of  the  Edge  Cover  Time  for  an 
Arbitrary  Start  Vertex 

We  now  have  the  probability  that  a  particular  edge  is  the  last  edge  traversed 
in  an  edge  cover  walk  on  started  at  vertex  j  expressed  strictly  in  terms  of 
weighted  sums  of  coefficients  we  discovered  in  our  discussion  of  such  a  walk 
started  at  vertex  1.  Our  next  task  is  to  use  this  information  to  prove  that 
the  expected  edge  cover  time  on  a  path  started  at  vertex  j  is  an  increasing 
function  of  j  for  2  <  j  <  (n  +  l)/2.  Once  we  have  shown  this,  then  by 
the  symmetry  of  P„,  we  will  have  that  the  expected  edge  cover  time  for 
P„  is  maximized  when  a  walk  is  started  at  the  center,  or  when  n  is  even, 
centers,  of  the  path.  While  studying  this  question  we  combined  the  ability 
of  Mathematica  to  do  exact  rational  number  arithmetic,  and  the  probability 
distribution  defined  in  Lemma  3.7  based  on  weighted  sums  of  the  /n,  to 
confirm  computationally  that  the  expected  edge  cover  time  on  Pn,  started 
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Start  Vertex 


n 

1 

2 

3 

4 

5 

25 

50 

75 

2 

2 

3 

6.5 

6 

4 

13.42 

12.75 

5 

22.71 

21.96 

22 

6 

34.35 

33.55 

33.76 

7 

48.32 

47.49 

47.92 

48.16 

8 

63.75 

64.43 

64.97 

9 

82.32 

83.25 

84.14 

84.48 

10 

104.08 

103.19 

104.37 

105.63 

106.36 

49 

2668.67 

2667.69 

2677.66 

2692.03 

2708.61 

2939.39 

149 

24507.3 

24506.3 

24534.5 

24576.1 

24626 

25977 

27297 

27763 

Table  3.1:  Values  of  Edge  Cover  Time 

at  vertex  j,  is  an  increasing  function  of  j  for  2  <  j  <  (n  +  l)/2  when 
5  <  n  <  150.  A  sample  of  the  actual  predicted  edge  cover  times  is  contained 
in  Table  3.1. 

Let  EjC^^  be  the  expected  edge  cover  time  for  a  walk  on  started  at 
vertex  j.  Then  EjC^  may  be  computed  by  finding  the  expected  vertex  cover 
time  for  a  walk  started  at  j,  EjC",  and  then  computing  the  expected  number 
of  additional  steps  the  walk  will  need  to  return  from  the  last  point  of  the 
vertex  cover  to  the  edge  closest  to  the  starting  point  which  was  not  traversed 
during  the  vertex  cover  portion  of  the  walk.  In  the  notation  previously 
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introduced  this  is  then 


n 

k=j+l 

+  E  (if  p(y,^i/tr’)i’(/tr‘). 

fc=i 

Using  equation(3.19)  this  becomes 

EjC^  =  EjC^  +  f2  (n-k  +  l)^  |  A]-^^)P{A}-^^) 

k=j+\ 

fc=i 

Combining  this  with  equations(3.23)  and  (3.24),  and  making  use  of 


and 


leads  to 


+  E  (n-i  +  l)P(A'‘,|.4--‘)P(>l]-‘) 

k=j+\ 

fc=l 

Now  using  Lemma  3.7  we  have 
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+  (^-^  +  1)  (fr-l-  fr)  [-  7— -  ) 

fc=j+l  [  r=l  \  ^  _ 

r  /n-lc-r4-'[\ 

+  |:w[-Ea-.-w(H37n)]' 

In  Chapter  4  we  will  show  that  in  all  graphs  the  expected  edge  cover 
time  for  a  random  walk  started  from  a  leaf  exceeds  the  expected  edge  cover 
time  for  a  walk  started  from  its  point  of  attachment  by  exactly  1  minus  the 
probability  that  the  last  new  edge  visited  in  a  random  walk  started  from 
that  leaf  is  the  edge  from  its  point  of  attachment  to  itself.  Based  on  our 
interpretation  of  the  coefficients  fn  we  know  that  for  a  random  walk  on 
Pn  started  at  vertex  1  the  probability  we  complete  an  edge  cover  on  edge 
(2, 1)  is  exactly  l/(n  —  1).  Thus  a  random  walk  started  at  vertex  2  will  have 
expected  edge  cover  time  (n  —  2)/(n  —  1)  steps  less  than  a  walk  started  at  1. 
This  differs  from  the  vertex  cover  case  where  the  expected  vertex  cover  on 
Pn,  considered  as  a  function  of  the  starting  vertex  j,  is  a  strictly  increasing 
function  of  j  for  1  <  j  <  (n  +  l)/2.  The  exact  difference  in  the  expected  edge 
cover  times  starting  from  vertices  1  and  2  is  confirmed  by  a  direct  calculation 
of  equation(3.28)  with  j  =  1  and  j  —  2.  It  is  also  worth  noting  that  using 
the  analysis  to  follow,  the  expected  edge  cover  time  for  a  random  walk  on  Pn 
started  at  vertex  3  exceeds  that  for  a  walk  started  at  vertex  2  when  n  >  5, 
however,  it  does  not  exceed  that  for  a  walk  started  at  vertex  1  until  n  >  9. 
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Theorem  3.8  On  the  path  Pn,  EjC^  is  a  strictly  increasing  function  for 
2  <  i  <  ^  and  n  >  5 . 


Proof 

In  order  to  prove  the  theorem  it  suffices  to  show  that  Ej+iC^  —  EjC^  > 
0  for  2  <  j  <  (n  —  l)/2.  Also  since  we  have  established  that  the  theorem  is 
true  computationally  for  n  <  150  we  will  only  need  to  show  this  for  n>  150. 
We  will  do  so  by  studying  the  form  of  this  difference  in  light  of  our  knowledge 
of  the  sequence  /„.  We  will  derive  a  sequence  of  inequalities  each  of  whose 
proof  would  ensure  our  result. 

Using  equation(3.28),  and  after  a  good  deal  of  algebra,  one  obtains 
Ei^,C‘-E,C‘  =  („  -  2j)  - 

+  Z]  +  1)  E  (/.-I  -  fr)  (t  -) 

k=j+2  [r=l  XK-r/^ 

Reversing  the  order  of  summation  and  reindexing  the  first  of  the  two 
sums  above  yields 


n—j—l  n—j—k+l  , 

+  E  E  (/.-I-/.) 

fc=l  r=l  ^ 


n  —  k  — 


2 _ )' 

—  r  +  1/ 
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J— 1  J— fc+1 

+  X;  E  (/--■  -  fr) 

k=l  r=l 


n  —  fc  —  r  +  1 


It  is  now  easy  to  see  that  combining  the  like  terms  in  the  two  summations 
results  in 

jf-1  \n-i-k+l  .  1  , 

n—j-1  n~j-k+l  /  1  \ 

^  SIS  «(;^3*r7TT)_- 

Focusing  on  the  first  of  the  two  sums  above  and  using  the  fact  that  the 
fn  are  a  decreasing  sequence  for  n  >  1,  it  is  apparent  that 

j-l  ^n-j-k+l  ,  1  . 

S  ^  ,=§+2  -  A;  -  r  +  1 )_  ^ 

So  to  prove  that  Ej^iC^  —  EjC^  >  0  it  suffices  to  show  that 


n—j—l  n—j—k+1 

>T,k  E  (/-■  -  /'■) 


-1 

n  —  k  —  r  +  I 


(3.30) 


Using  identity(3.5)  we  obtain 


n—j—k+l  .  _i  .  n—k 


ri  —  A:  —  r+l/’ 


Applying  this  to  the  right  side  of  inequality(3.30)  yields 

n—j—l  n—k  /  1  \ 
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fn—k—i+l) 


(3.31) 


n-j-l  j-1  -1, 

=  EWEl- 

k=j  i=i 

It  should  be  noted  that  in  this  form  each  of  the  inner  sums  have  exactly 
j  —  1  terms  and  that  there  are  a  total  of  n  —  2j  terms  in  the  outer  sum. 
Reversing  the  order  of  summation  leads  to  a  good  deal  of  cancellation  and 
so  (3.31)  becomes: 


i-i  /_ix 

J  XI  ( ”7" )  (^)  S  ifk-i+l) 

+  (n-jf  -l)Xl 

i=l  ^  ^ 

Using  identity (3. 5)  the  third  term  in  this  expression  for  (3.31)  can  be  written 
as 

(7)  fj-i+i  =  (^  -  J  -  1)  (^1  -  • 

Subtracting  this  term  from  both  sides  of  (3.30)  and  multiplying  the  resulting 
inequality  by  —  1  yields 


1  n 

J  +  - - 

3  n-j 


j-i 


i=l 


j-i 


■  <  jI:  -  A-i-i+i  +  E  7  E  ih-i+i)-  (3.32) 


t=l 


n-j-l 


k=j+l 


Let  E  denote  the  first  term  on  the  right  side  of  inequality(3.32)  and  F 
the  second  term.  Then  in  order  to  prove  that  Ej^iC^  —  EjC^  >  0  it  suffices 
to  prove 


F>j  + 


1 


n 


n-'j 


E. 


(3.33) 
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Once  again  using  identity(3.5)  we  have 


E  =  j 

Note  that  /i  =  1  so 


n-2j+l  /  Y  ' 

^  S  [n-j-k  +  l^ 


fk 


n-2j+l 

y: 


n  —  j  —  k  +  \ ^ 


fk, 


and  (3.33)  may  be  rewritten  as 


1 


n-2j+l 


F>--i+i  Y 
J  k=2 


fk. 


(3.34) 


n-j-k+1 

In  order  to  prove  that  this  inequality  holds  we  now  consider  the  following 


two  cases: 


Case  (1):  f  <  j  <  ^.  Case  (2):  2  <  j  <  f . 


Case  1 

Studying  the  right  side  of  inequality (3.34)  and  remembering  that  /r  < 
Vr  >  2  it  is  easy  to  see  that 


1 


n— 27  +  1 


fk<  fk<^-j. 

k=2  ^ 


n  —  j  —  k  +  1 


Thus  for  Case  1  it  suffices  to  show  that 


F> 


n  n  —  I 

when  —  <  7  < - 

4  -  J  -  2 


(3.35) 
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j-1 

Changing  the  order  of  summation  in  F  —  (^)  ifk-i+i)  yields 

t=l  k=i+l 


n-j-l  j-l  .1. 

F=  E  E  (-) (A-<+.) • 

fc=j+i  t=i  ‘  ^ 

Remembering  that  hn  denotes  the  harmonic  number,  we  note  that 

>  hj-ifj+i  whenk  =  j+l, 

(t)  (A_i+i)  >  /ij-i/i+2  when  A;  =  j  +  2, 

i=l 


~  ^  /In 

(j-j  ifk-i+i)  >  hj-ifn-j-i  when  k  =  n-  j  -1. 


Summing  these  inequalities  yields 


F  >  hj_i{n  -  2j  - 


In  Appendix  A  we  show  that  for  r  >  3,  /,.  >  (/t)-  ^ 

f  <  i  <  it  is  certainly  true  that  fn-j-i  >  (|)  ,  which  implies 


3\  f 


(3.36) 


A  simple  application  of  the  Cauchy-Schwartz  inequality  to  hj_i  and  hn-j-i 


shows  that 


hj^i  ^  ^  hji—j—i  ^'j—\ 
hji—j—i  1 
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But  hn-j-i  —  hj-i  <  ln4  when  ^  <  j  <  and  n  >  150,  leading  to  the 
conclusion  that 


If  we  let  j  =  ^  —  a,  where  4  <  a  <  f;  inequality(3.37)  becomes 


(3.37) 


(3.38) 


and  we  have 

1  .  /n  \  1  . 

7-'+(2-d=r‘+“' 

Thus  to  prove  inequality(3.35)  it  suffices  to  show  that 


|i)(2„-l)>l-l  +  „,  =  (3.39) 


For  n  >  150  and  j  >  j,  <  |.  So 


3A  /  *’^4  \  1 


and  for  n  >  150 


1  ,  4 

Q;-->a-l  +  -. 

2  n 


This  then  implies  inequality(3.39)  which  was  sufficient  to  prove  that 


77  71  —  1 

>0  for  -<j<  — . 


Case  2 


48 


Now  we  wish  to  show  the  result  for  2  <  j  Simple  manipulation  of  our 

general  description  of  expected  edge  cover  time  in  equation(3.28)  with  j  =  2 
and  j  =  3  shows  that  for  n  >  5,  >  E2C''.  So  we  really  only  need  to 

prove  Ej+iC^  —  EjC^  >  0  for  j  >  3.  Once  again  referring  to  inequality (3.34), 
note  that  applying  identity (3. 5)  (and  reindexing)  we  have 


"if'  ( — )/i-i: 


n-j 


Thus  to  establish  inequality(3.34)  it  suffices  to  show  that 


or  even 

Let  j  =  a{n—  1)  with  <  a  <  |  (these  bounds  are  appropriate  since 
here  in  case  (2)  j  <j).  Then  referring  to  inequality (3.40)  and  remembering 
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that  /i„  >  Inn,  Vn  >  1,  we  want 


f\n[a{n  —  1)  —  1]' 
4> 


^(1— Q)(n— 1) 

Canceling  n  —  1  from  both  sides  we  need 

f\n[a{n  —  1)  —  1] 


(1  —  2Q:){n  —  1)  >  a{n  —  1). 


(1  -  2a)  >  a. 


But  hk  <  ln(A;)  +  1  for  /c  >  1  so  it  suffices  to  show  that 


^  ln[Q;(n  —  1)  —  1] 

AJ  \  In  [(1  —  a){n  -  1)]  +  1 


(1  -  2a)  >  a. 


or  equivalently 


(1  —  2q:)  ln[Q:(n  —  1)  —  1]  —  Q;(ln[{l  —  a){n  —  1)]  +  1)  >  0  (3.41) 

When  n  >  150  and  a  =  or  a  =  |  the  inequality  is  valid  proving  that 
the  expression  on  the  left-hand-side  of  (3.41)  is  positive  at  the  endpoints 
of  the  interval  in  question.  Using  basic  calculus  one  finds  that  the  second 
derivative  of  this  expression,  taken  with  respect  to  a,  is  negative  throughout 
the  interval,  and  thus  the  expression  must  be  positive  throughout  the  interval. 
By  our  chain  of  sufficient  inequalities  we  then  have 


-  EjC^  >0  for  2<j<j. 


Cases  (1)  and  (2)  in  combination  show  that  for  n  >  150,  EjC^  is  a  strictly 
increasing  function  of  j  for  2  <  j  <  ■ 
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Combining  the  fact  that  >  EiC*^  forn  >  9  with  our  computation  of 
the  expected  edge  cover  time  for  small  n  we  have  as  an  immediate  corollary 
that  it  is  maximized  by  starting  at  the  endpoint  for  walks  on  when  n  <  8, 
and  maximized  by  starting  at  the  center/centers  for  all  longer  paths. 

3.7  The  Additional  Number  of  Steps  Needed 
When  Starting  From  the  Center 

Equation(3.28)  provides  a  formula  to  compute  the  expected  edge  cover  time 
starting  from  any  point  on  the  path  using  the  vertex  cover  time  and  weighted 
sums  of  our  coefficients  /„.  In  Section  3.3  we  proved,  for  a  random  walk 
started  at  an  endpoint  that  the  additional  number  of  steps  needed  to  assure 
an  edge  cover,  after  the  vertex  cover  walk  was  completed,  was  0(n^/logn). 
We  can  prove  that  this  quantity  provides  an  upper  bound  for  walks  started 
from  the  center /centers. 

Theorem  3.9  For  a  random  walk  started  at  the  center/centers  of  Pn,  the 
expected  additional  number  of  steps  which  are  needed  to  complete  an  edge 
cover  after  all  vertices  have  been  visited  at  least  once  is  0(n^/ log  n). 

Proof 
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We  will  first  prove  that  the  theorem  is  true  for  n  odd.  We  will  then  use 
that  fact  to  make  the  argument  for  even  n.  The  quantity  in  which  we  are 
interested  is  EjC^  —  EjC"  when  j  =  (n  +  l)/2.  This  consists  of  the  latter  two 
sums  in  equation(3.28). 


EjC^-EjC^  =  (n-k  +  l) 

k=j+l 


-E  (fr-l-fr) 

r=l 


'k  —  j  —  r  +  l' 


k  —  r 


3-1 

+  E  ^ 

fc=i 


3-k 


E  (/->  -  fr) 


r=l 


j  —  k  —  r  +  1 
n  —  k  —  r  +  1 


(3.42) 


Reindexing  the  first  summation  we  have 


E  (n-k+l) 

k=j+l 


r=l  \ 


n-j 

E*^ 

k=l 


{n-j+l)-fc 


-  E  (/-.  -  M 


k  —  r 
n  —  j  —  r  —  k-\- 2' 


r=i 


n  —  r  —  k  +  l 

We  are  interested  in  the  quantity  when  j  =  {n+  l)/2,  which  implies 
n  —  j  =  j  ~  1  =  {n  —  1) /2.  So  in  this  case 


.  (3.43) 


n—  1 
2 


EjC^  -  EjC^  =  2Y^  k 

k=i 


/!i±I  _  _  r-  4-  1  ’ 

r=l 


n  —  k  —  r  +  1 


.  (3.44) 


It  is  easier  to  analyze  equation(3.44)  if  we  break  up  the  summation  yielding 

1 


(n-1) 

2 


'  ^+1 
2 


1  \  2  /  2  /  n±L  _  U  _  J.  ^ 

-]EjC^-EjC'’  =  E  i-k)  E  V 

2)  ’  ’  h  \  h  \  n-k-r+\ , 


(”-1) 


+  E 

k=\ 


E  fr 

r=l 


n  —  k  —  r  -\- I 
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This  in  turn  may  be  broken  up  further  into 


l)E,c--E,cr  = 


n^-l 

2 

"+i  h 

2 

-  E 

E 

fe=i 

r=2 

n  —  1 

2 

/Tl+l 

+  E 

(  ' 

r=l 

\  n  —  r 

n—  1 

0 

^-k 

fc-r  +  l 
n  —  k  —  r  +  1 


-5-  2  /  !i±i  _  jU  _  ^  I  1 

*  S‘S 

Now  we  use  the  fact  that  /o  =  0  and  reindex  again  to  obtain 

/n  „  _  „ 


-)  EjC^  -  = 

n-l  1  n-1  ,  , 

-  E  E  /J  ^ . . 

rtl  \  n-k-r  J 
^  /n±l-r\ 

+  EA  E  (^+1)  E  fr 

r=i  \  n  r  J 


±i  _  r-\  ^  2'  ^  / ”±1  _  _  7- 

_ Ll  I  .  1^  N^  f  [2 _ Z _ L 


n  —  k  —  r 


This  then  reduces  to 


n  — 1  1  n  — 1 


/n+l  ^ 

EjCr-EjCr  =  2  ^  Y.  fA^—r — 

M  rt;  \n-k-T 


n  —  k  —  r 


n— 1  1  n— 1 


1  ^-k 


=  E  E 

fc=o  7=1 


n  +  1  —  2A;  —  2r 
n  —  k  —  r 


n— 1  1  n  — 1  1^ 

2  2 

=  2  E  E  A 

fc=:0  r— 1 

Tl— 1  I  TT  — 1  u 
2  ^  2  ^ 

-  (^-1)  E  E  Z'- 

fc=0  r=l 


n  —  k  —  r 


(3.45) 
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n— 1 
2 


-  (n-l)j2fr  (hn-r  “ 


r=l 


(3.46) 


Referring  to  Appendix  A  we  know  that 


n  — 1 
2 


2E 


r=l 


So 


EiC^  -  EiC^  < 


O 


n" 


Now  let  n  be  even.  Consider  a  random  walk  started  at  vertex  n/2  —  1  on 
Pn  and  another  walk  started  at  n/2  —  1  on  Pn+i-  It  becomes  clear  that  the 
expected  edge  cover  time  of  the  latter  must  be  at  least  as  great  as  the  former 
when  one  considers  the  following  modification.  Modify  P„  by  adding  a  loop 
to  vertex  n.  Now  upon  reaching  vertex  n  let  the  walk  have  probability  1/2  of 
stepping  back  to  vertex  n—  1  and  probability  1/2  of  stepping  around  the  loop. 
Compare  a  walk  started  at  vertex  n/2  —  1  on  this  modified  path  to  that  of  the 
walk  on  P„.  Each  time  we  visit  vertex  n  on  we  return  to  vertex  n  —  l(and 
therefore  back  towards  untraversed  edges)  with  probability  1  taking  1  step 
to  do  so.  On  the  modified  path  we  step  back  to  vertex  n  —  1  only  1/2  of 
the  time  and  the  other  1/2  of  the  time  waste  a  step  on  the  loop.  It  is  clear 
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that  in  the  modified  case  the  expected  number  of  steps  to  traverse  all  the 
directed  edges  from  1  to  n  must  be  at  least  as  great  as  the  expectation  on 
P„.  Thus  it  is  clear  that  when  one  compares  the  edge  cover  walks  on  and 
Pn+ij  as  described  above,  that  the  expected  edge  cover  time  for  the  walk  on 
P„  must  bound  from  below  that  of  the  walk  on  Pn+i-  But  from  our  previous 
results  the  edge  cover  time  for  a  walk  on  P„  is  maximized  by  a  walk  started 
at  vertex  n/2  —  1.  Thus  we  have 


If  we  denote  by  An  the  expected  additional  number  of  steps  to  complete  the 
edge  cover  after  visiting  all  vertices  on  P„  then  using  (3.28) 


An  < 
An  < 


5 
4 

-jn  +  -  +  An+i. 


(n  -  1)^  +  -  +  An+i 


Since  we  have  just  proved  that  An+i  =  0(n^/logn)  we  now  have  An  = 
0{n^ /\ogn)  as  desired.  ■ 
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CHAPTER  IV 


GENERAL  RESULTS  FOR  EDGE 
COVER  TIME  ON  TREES 

4.1  Minimum  Edge  Cover  Time  for  a  Tree 

In  Chapter  1  we  mentioned  a  number  of  r  esults  that  have  been  proven  for  the 
vertex  cover  time  for  trees.  Having  extensively  studied  one  tree — the  path, 
we  now  focus  on  other  trees  and  show  parallel  results  for  edge  cover  time. 
The  main  result  of  this  chapter  is  the  theorem  below. 

Theorem  4.1  For  any  n>S,  the  star  on  n  vertices,  Sn,  with  starting  vertex 
at  the  center,  has  the  minimum  edge  cover  time  among  all  trees  on  n  vertices. 

In  order  to  prove  the  theorem  we  will  fir  st  show  that  the  minimum  edge 
cover  time  for  any  graph  is  attained  by  a  walk  started  from  some  non-leaf 
vertex.  We  will  then  prove  that  the  edge  cover  time  of  a  tree  must  be  at 
least  2  more  steps  than  the  minimum  vertex  cover  time  for  that  tree  and 
finally  argue  that  the  star  is  the  only  graph  which  achieves  this  minimum. 
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Having  proved  this  theorem  we  will  also  determine  the  probability  that  a 
random  walk  on  Sn  completes  a  vertex  cover  in  exactly  k  steps  which  leads  to 
two  interesting  identities  involving  the  Stirling  Numbers  of  the  Second  Kind. 
We  will  note  that  the  edge  cover  time  for  a  walk  on  any  tree  is  bounded 
above  by  the  sum  of  the  vertex  cover  time  and  the  maximum  hitting  time 
in  that  tree.  This  will  lead  to  a  bound  on  the  edge  cover  time  of  a  balanced 
k-ary  tree  of  0{n\og'^n  +  n\ogn).  Finally  considering  all  we  have  established 
regarding  the  edge  cover  time  of  random  walks  on  tr  ees  we  conjecture  that 
it  is  maximized  by  a  walk  started  at  the  center /centers  of  a  path. 

While  studying  edge  cover  walks  on  the  path  we  determined  that  the  edge 
cover  time  for  a  random  walk  on  P„,  started  at  an  endpoint,  exceeded  that 
for  a  walk  started  at  its  point  of  attachment  by  exactly  (n  —  2)/(n  —  1)  steps. 
It  turns  out  this  is  true  for  the  star  as  well.  That  is,  the  edge  cover  time  for 
a  walk  started  from  a  leaf  on  Sn  exceeds  that  of  a  walk  started  at  the  center 
by  (n  —  2)/(n  —  1)  steps.  This  is  the  reverse  of  the  situation  for  the  vertex 
cover  in  which  the  vertex  cover  time  is  less  for  walks  started  at  a  leaf  than 
for  walks  started  at  their  points  of  attachment.  These  lesults  led  us  to  ask 
the  question:  For  any  gxaph,  G,  can  we  characterize  the  edge  cover  time  for 
a  walk  started  from  a  leaf  of  G  in  terms  of  the  edge  cover  time  for  a  walk 
started  at  its  point  of  attachment.  The  answer  is  yes. 
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Lemma  4.2  Given  a  graph,  G,  with  at  least  3  vertices,  a  vertex,  x,  of  G 
with  degree  1,  and  vertex,  y,  the  neighbor  of  x  in  G,  then 

E,C^  = 

EyC^  +  1  —  Y*i[walk  started  at  x  completes  an  edge  cover  on  edge  (y,x)]. 

Proof 

Consider  the  set  of  all  walks  started  at  y  and  observed  until  they  first 
achieve  an  edge  cover  of  G.  This  set  may  be  partitioned  into  two  classes  based 
on  the  last  edge  the  walk  traverses. 

Ay  —  walks  started  at  y  which  complete  an  edge  cover  of  G  on 
edge  {x,y). 

By  =  walks  started  at  y  which  complete  an  edge  cover  of  G  on 
an  edge  other  than  {x,y). 

Likewise  we  may  partition  the  set  of  all  walks  started  at  x  and  observed  until 
they  first  achieve  an  edge  cover  of  G  into  two  classes. 
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Ax  =  walks  started  at  x  which  complete  an  edge  cover  of  G  on 
edge  {y,x). 

Bx  =  walks  started  at  x  which  complete  an  edge  cover  of  G  on 
an  edge  other  than  {y,x). 

Given  an  element  of  set  Ay  one  may  create  a  unique  element  of  set  Ax  by 
adding  a  step  from  x  to  y  to  the  beginning  of  the  walk  and  deleting  the  step 
from  X  to  y  from  the  end  of  it.  Similarly  one  may  take  an  element  of  Ax 
and  create  a  unique  element  of  Ay  by  the  reverse  process.  Thus  there  is  a 
bijection  between  sets  Ay  and  Ax  so  they  must  have  the  same  cardinality,  and 
moreover,  the  operation  described  does  not  change  the  length  of  the  walks 
so  the  expected  length  of  the  walks  in  each  set  must  be  the  same. 

Now  consider  the  walks  in  By  and  Bx-  Each  walk  in  By  may  be  converted 
to  a  walk  in  Bx  by  the  addition  at  the  beginning  of  the  walk  of  a  step 
from  X  to  y.  One  may  reverse  this  process  to  convert  any  element  of  Bx  into 
an  element  of  By.  Thus  these  sets  also  have  the  same  cardinality  and  the 
expected  length  of  the  walks  in  By  must  be  1  step  less  than  the  expected 
length  of  the  walks  in  Bx- 
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If  we  indicate  the  expected  length  of  a  walk  in  X  by  Lx,  then  putting 
these  observations  together  yield 

EyC^  =  P{Ay)LA,  +  P{By)Ls, 

-  P{A,)La^  +  P{B,)Lb^ 

But  P(Aa:)  =  P{Ay)  >  0,  P(Px)  =  P{By)  >  0,  La,  =  La^  and  Lb,  =  Lb„  + 1 
so 

E,C^  =  EyC--  +  P{By)  =  EyC^  P  1  -  P(^).  ■ 

Thus  from  Lemma  4.2  we  now  know  that  the  minimum  expected  time  to 
cover  all  the  edges  of  a  graph  occurs  from  some  non-leaf  vertex.  In  particular, 
since  Sn  has  only  one  non-leaf  vertex,  it  is  easy  to  compute  the  minimum 
edge  cover  time  for  it.  An  edge  cover  of  the  star  for  a  walk  started  at  the 
center  consists  of  one  step  to  some  leaf  followed  by  a  vertex  cover,  and  then 
one  additional  step  to  traverse  the  edge  from  the  vertex  at  which  the  vertex 
cover  ended  back  to  the  center.  Thus  the  minimum  edge  cover  time  on  Sn  is 
exactly  EzC'’  +  2  where  2:  is  any  leaf  of  S'„. 

Proof  of  Theorem  4.1 

Let  G  be  any  tree  on  n  >  4  vertices,  and  suppose  that  y  is  a  non-leaf 
vertex  of  G  which  minimizes  the  edge  cover  time  on  G.  Brightwell  and 
Winkler[7]  point  out  that  the  vertex  cover  time  from  y  exceeds  the  vertex 
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cover  time  starting  from  some  leaf  in  G  by  at  least  one  step,  with  equality 
only  when  G  is  S^-  This  is  true  because  a  walk  which  completes  a  vertex 
cover  of  G  creates  a  spanning  tree  of  G  in  the  process.  If  such  a  walk  is 
started  at  a  non-leaf  vertex  it  must  walk  to  some  leaf  and  then  trace  out  the 
tree.  The  star  is  the  only  tree  in  which,  starting  from  a  non-leaf  vertex,  the 
expected  time  to  reach  any  leaf  is  exactly  1  step.  Thus  if  G  is  any  tree  other 
than  Sn,  EyC"  >  E^C'"  -f  1  for  some  leaf  x.  An  edge  cover  in  G  started  at  y 
will  consist  of  a  vertex  cover  and  then  at  least  1  more  step.  Thus 

EyC^  >  EyCr  +  1  >  +  2, 

with  equality  on  the  right  only  when  G  is  5'„. 

Brightwell  and  Winkler  proved  that  the  minimum  vertex  cover  time  for 
any  tree  on  n  vertices  occurs  for  a  walk  on  S'„  started  from  a  leaf.  Thus,  if  G 
is  not  Sn  we  have 

EyC^G  >  ErCl  +  2  >  E,C^^  +  2, 

where  2;  is  any  leaf  of  Sn- 

This  leaves  only  the  cases  where  n  <  4.  The  only  tree  on  3  vertices  is  the 
star.  For  n  =  4  there  are  two  trees:  the  path  and  the  star.  Using  the  results 
of  Chapter  3  the  minimum  edge  cover  time  for  F4  is  found  to  be  12  3/4  steps 
for  a  walk  started  at  one  of  the  centers,  while  the  minimum  on  5*4  is  11  steps 
for  walks  started  at  the  center.  Thus  the  theorem  is  true  Vn  >  3.  ■ 
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Given  the  result  in  Lemma  4.2  it  is  now  easy  to  determine  the  maximum 
edge  cover  time  for 

Corollary  4.3  The  maximum  edge  cover  time  on  Sn  occurs  for  a  random 
walk  started  at  a  leaf  and  has  value 

2(n  —  l)hn-2  +  2  +  _  1 )  ■ 

Prom  Lemma  4.2  we  know  that  this  maximum  occurs  from  a  leaf,  say  it 
is  labeled  x,  and  that  the  edge  cover  time  from  a  leaf  exceeds  that  from  the 
center,  say  it  is  labeled  y,  by 

1  —  Pv[walk  started  at  x  completes  an  edge  cover  on  edge  (y,x)]. 

Proposition  4.4  The  probability  that  a  random  walk  on  Sn,  started  at  a 
leaf,  X,  completes  a  vertex  cover  without  returning  to  x  is  • 

Proof  of  Proposition  4.4 

Consider  the  sequence  of  leaves  visited  by  such  a  walk.  Having  started 
at  x  the  walk  visits  the  center  point  and  is  then  equally  likely  to  step  to 
any  of  the  n  —  1  leaves.  Now  consider  all  random  walks  started  from  the 
center.  It  is  easy  to  see  that  the  probability  any  such  walk  ends  at  vertex  x 
is  l/(n  —  1).  Thus  of  all  walks  started  at  x,  l/(n  —  1)  will  fail  to  revisit  x 
prior  to  completing  a  vertex  cover.  ■ 
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This  result  proves  that  the  probability  that  a  random  walk  on  Sn  started 
at  a  leaf,  x,  completes  an  edge  cover  on  the  edge  from  the  center  to  a:  is 
1/in  —  1).  Thus  we  have  the  result  in  Corollary  4.3.  ■ 


4.2  Stirling  Number  Identities  From  Stars 

There  is  a  second  argument  proving  Proposition  4.4  which,  while  more  tech¬ 
nical  in  nature,  provides  two  interesting  identities  involving  the  Stirling  Num¬ 
bers  of  the  Second  Kind.  First  we  will  need  to  determine  the  probability  that 
a  random  walk  started  at  a  leaf  of  Sn  completes  a  vertex  cover  exactly  on 
step  2k,  k  an  integer. 


Lemma  4.5  Given  a  random  walk  on  Sn  started  at  some  lea.f,  x,  the  proha- 
bility  that  this  random  walk  completes  a  vertex  cover  on  step  m  =  2k  is  given 
by 


P{E,C^  -  2k)  = 


(n  —  2)\Sik,  n  —  2) 


[n  —  1)*  ’ 

where  <S'(a,6)  represents  a  Stirling  number  of  the  Second  Kind. 


(4.2) 


Proof 

Let  m  =  2k.  Note  that  k  must  be  at  least  n  —  2.  For  this  argument 
consider  the  center  vertex  to  be  labeled  y.  The  vertices  visited  by  the  random 
walk  form  a  sequence  in  which  we  may  consider  position  0  to  be  filled  by  our 


63 


start  vertex  and  every  odd  element  will  be  filled  by  the  character  y.  Suppose 
the  starting  point  is  chosen  with  uniform  probability  from  the  leaves  of  Sn- 
Then  the  walk  will  fill  A;  +  1  slots  in  the  sequence  from  an  “alphabet”  of 
n—\  characters.  If  the  walk  completes  a  vertex  cover  on  step  m  it  must  have 
visited  all  but  one  of  the  n  —  1  leaves  by  step  m  —  2  =  2{k  —  1).  In  order 
to  count  the  number  of  sequences  that  fulfill  this  requirement  we  may  first 
identify  one  of  the  leaves  to  be  missed  prior  to  step  m.  There  are  n  — I  ways 
to  do  this.  Then  the  number  of  ways  to  fill  k  slots  in  the  sequence  from  an 
alphabet  of  size  n  —  2,  assuming  all  characters  must  appear  at  least  once,  is 
(n  —  2)\S(k,n  —  2).  The  total  number  of  ways  to  fill  k  slots  from  an  alphabet 
of  n  —  1  characters  is  (n  —  1)^.  The  probability  that  on  step  m  we  reach  the 
remaining  unvisited  vertex  is  l/(n  —  1).  Combining  these  results  we  have 
that  the  probability  that  a  random  walk  completes  a  vertex  cover  on  step 
m  =  2A:  is 

/  1  \{n—l){n  —  2)\S{k,n  —  2)_{n  —  2)\S{k,n  —  2) 

\n  — 1/  (n  —  I)*'  (n  —  1)*^ 

But,  since  the  starting  vertex  was  chosen  with  uniform  probability  this  is 
also  the  fraction  of  all  walks  starting  at  vertex  x  which  complete  a  vertex 
cover  on  step  m.  m 
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Since  the  random  walk  completes  a  vertex  cover  with  probability  1  we 


have 

f;  p\E,c’  =  2k]=  t 

k=n-2  k=n-2  V"- 

which  leads  immediately  to  the  first  of  our  two  identities.^ 


=  1. 


Corollary  4.6 


S{k,n)  _  1 

ho  (^^+1)'^  nV 


This  may  be  confirmed  directly  as  follows: 


^  P[E,C’  =  2k]  = 

k=n—2 

=  t 

k=n-2  “  1) 

1  fn-  2 


=  E 


k~n 


I2  (n  -  1)  j=o 


-kE  r  ru-^y in- j- 2)^ 


j 


j=0 


J 


k=n—2 


n  —  1 


j=0 


n—2 


n  —  \ 


71—2 


r=l 


n  —  1 


^Wilf  points  out  that  these  identities  may  also  be  proven  by  appropriate  substitutions 
for  X  in  the  generating  function  Ylik>o  S{k,n)x^  =  x”/(l  —  a;)(l  —  2x)  •  •  •  (1  —  nx). 
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n— 1 

E 


'n  —  1 

I  r  j 


(-ir 


(n  —  1)  —  r 


n— 2 


r=0 

f  (n—  l)\S{n  —  2,n—l) 


n  —  1  J 

+  1  =  1. 


+ 


n  —  1 
n  —  I 


n—2 


\  (n  -  l)™-2 
The  last  equation  follows  from  the  fact  that  S{n  —  2,n  —  1)  —  0. 


We  may  use  similar  reasoning  to  calculate  what  fraction  of  random  walks 
complete  a  vertex  cover  on  step  m  and  have  not  returned  to  the  start  vertex. 
We  may  choose  the  vertex  which  will  be  visited  on  step  m  in  n—2  ways.  Since 
slot  0  is  assumed  to  be  filled  by  the  character  x,  and  none  of  the  remaining 
slots  may  be  filled  by  x,  or  the  end  vertex,  we  must  fill  fc  —  1  slots  from  an 
“alphabet”  of  n  —  3  characters  using  each  character  at  least  once.  This  can 
be  done  in  (n  —  3)\S{k  —  l,n  —  3)  ways.  Thus  the  total  number  of  strings 
matching  our  requirements  is 


(n-2)(n-3)!5'(A;-  l,n-3). 

There  are  (n—  1)*^“^  strings  starting  at  x  and  ending  at  step  m—2.  The  prob¬ 
ability  we  step  to  the  remaining  unvisited  vertex  on  step  m  is  still  l/(n  —  1). 
So  the  probability  that  the  random  walk  completes  a  vertex  cover  without 
returning  to  the  starting  point  is 

/  1  N  (n-2)(n-3)!6'(fc-  l,n-3)  _  {n  -  2)\S{k  -  l,n  -  3) 
\n-l/  (n  -  (n  -  1)*= 

Prom  our  earlier  argument  we  know  that  the  probability  a  random  walk 

on  the  star  completes  a  vertex  cover  without  returning  to  the  starting  point 
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is  l/(n  —  1).  Thus  we  obtain 

^  (n-2)!5(fc-  l,n-3)  _  /  1  N 

kh-2  (n-l)*=  “U-J' 

which  leads  to  the  second  identity. 

Corollary  4.7 

S{k,n-  2)  _  1 

k>0 

This  identity  may  be  established  directly  by  remembering  that 


S{x,y)  =  yS{x  -  l,y)  +  S(x  -  l,y  -  1). 

When  applied  to  (4.2)  we  have 

1  =  V  iri-2)\S{k,n-2) 

.ir-2  - 1)^ 

°°  (n  —  2V 

=  /  _iu  [in-2)S{k-l,n-2)  +  S{k- l,n-3)] 

Thus 


~  (n-2)! 

kh'-2  -  1)'' 


S{k  —  l,n 


3)  = 


=  1- 


~  (n-2)! 


{n 


2)S{k-l,n 


2) 
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1  - 

1  - 

1  - 

1  - 


n 


n  —  1 


E 


(n-2)! 


n 


k=n—2 
oo 


k^-l  («-l) 
n-2\  ^  {n-2)\  . 


n 
n  —  2 
n  —  1 


1 


n  —  1 


4.3  A  Global  Upper  Bound  for  Trees 

In  Section  2.2  we  established  an  upper  bound  on  edge  cover  time  of  0{dB^). 
In  Chapter  3  we  proved  that  while  this  bound  would  provide  0{2n^)  on 
Pn,  in  reality  the  edge  cover  time  is  bounded  by  (5/4)n^  +  0(n^/logn).  In 
this  section  we  establish  an  upper  bound  specific  to  trees  which  improves  on 
0{dBe)  in  many  cases. 

Inspiration  for  the  following  theorem  comes  from  a  frequently  used  prop¬ 
erty  of  hitting  time  known  as  the  Essential  Ed(je  Lemma.  It  makes  com¬ 
putation  of  the  hitting  time  between  two  vertices  connected  by  a  unique 
path  relatively  straightforward.  The  form  below  is  due  to  Brightwell  and 
Winkler  [7]. 

Lemma  4.8  Let  vertices  x  and  y  he  at  distance  k  in  a  graph  G,  with  a  unique 
path  X  =  vq,vi,  . . .  ,Vk  =  y  between  them.  For  each  i,0  <  i  <  k,  let  Gi  he  the 
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component  of  G  —  Vi-i  —  t/j+i  that  contains  the  point  Vi;  similarly  Go  will  be 
the  component  of  G  —  Vi  containing  x.  Let  mi  be  the  number  of  edges  in  Gi. 
Then  the  expected  hitting  time  ExHy  is  equal  to 

k-l 

kf  +  2^2  ‘iTiiik  —  i). 

t=0 

Of  course,  in  a  tree  all  vertices  are  connected  by  a  unique  path.  Thus 
we  wondered  if  this  property  could  be  exploited  in  computation  of  the  edge 
cover  time  as  well. 

Theorem  4.9  Given  a  tree,  T, 

C^<Clf  +  Ht.  (4.3) 


Proof 

Let  a;  be  a  vertex  in  T  for  which  E^G^  is  maximized.  Consider  T  to  be 
rooted  at  x.  Consider  the  forest  created  when  x  is  deleted  from  T  and  call 
each  of  the  subtrees  of  T  in  this  forest  a  branch  of  T.  Then  an  edge  cover 
walk  on  T  must  proceed  from  the  starting  point  x  and  first  accomplish  a 
vertex  cover  of  T.  While  accomplishing  this  vertex  cover  the  walk  must  visit 
each  of  the  leaves  in  each  of  the  branches  of  T.  The  vertex  cover  walk  is 
completed  at  a  leaf  in  some  branch  of  T.  At  this  point  it  is  the  case  that  the 
walk  has  traversed  every  directed  edge  in  all  other  branches  of  T.  Thus  the 
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only  directed  edges  which  may  not  have  been  traversed  are  the  “back”  edges 
on  the  unique  path  from  the  leaf  at  which  the  vertex  cover  was  completed 
to  the  root  x.  Thus  the  expected  number  of  steps  remaining  to  complete  an 
edge  cover  is  no  more  than  the  hitting  time  from  this  leaf  to  x  providing  the 
desired  result.  ■ 

This  result  gives  an  immediate  bound  for  the  edge  cover  time  of  balanced 
k-ary  trees  on  n  vertices.  Zuckerman[26]  proves  that  the  vertex  cover  time 
for  such  trees  is  0(nlog^n).  Aldous[2]  points  out  that  using  the  Essential 
Edge  Lemma  one  may  prove  that  the  maximum  hitting  time  in  such  trees  is 
O(Tilogn). 

Corollary  4.10  The  edge  cover  time  for  a  balanced  k-ary  tree  on  n  vertices 
is  0(nlog^  n). 

Combining  the  results  of  Zuckerman  and  Aldous  with  Theorem  4.9  pro¬ 
vides  the  proof  of  this  corollary. 

Thus  we  have  another  example  of  a  tree  in  which  the  expected  additional 
number  of  steps  needed,  after  completing  a  vertex  cover,  in  order  to  assure 
an  edge  cover  is  of  lower  order  than  the  vertex  cover  time.  Note  however, 
that  applying  this  result  to  P„  would  not  have  provided  the  correct  result 
since  the  vertex  cover  time  and  the  maximum  hitting  time  on  the  path  are 
both  Q{v?). 


70 


When  one  considers  Brightwell  and  Winkler’s  conjecture  that  vertex  cover 
time  for  trees  is  maximized  on  the  path,  in  conjunction  with  Theorem  4.9, 
Corollary  4.10  and  what  we’ve  established  for  the  edge  cover  time  on  the 
path,  it  seems  reasonable  to  believe  that  edge  cover  time  is  maximized  for 
trees  by  a  walk  started  from  the  center  of  a  path.  This  is  certainly  one 
question  demanding  further  study. 
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CHAPTER  V 


A  TIGHT  BOUND  ON  THE  EDGE 
COVER  TIME  OF  THE  CYCLE 

5.1  Vertex  Cover  Time  on  Cn 

A  cycle  on  n  vertices,  is  a  connected,  regular,  simple  graph  in  which  each 
vertex  has  degree  2.  We  briefly  mentioned  in  Section  1.2  that  the  vertex  cover 
time  for  Cn  is  ^2)  •  One  classic  argument  establishing  this  is  a  combination  of 
a  coupon  collectors  approach  with  the  recursive  argument  used  to  establish 
the  hitting  times  on  the  path.  The  outline  here  parallels  Wilf[23]. 

Consider  the  point  in  the  random  walk  on  Cn  at  which  the  new  vertex 
has  first  been  visited.  Then  the  walk  has  visited  j  contiguous  vertices  which 
form  a  path  and  is  resting  on  the  endpoint  of  the  path.  How  long  will  it 
take,  on  average,  to  visit  the  (j  +  vertex?  We  may  consider  the  walk  to 
be  on  a  path  with  j  +  2  vertices  having  covered  the  vertices  from  2  to  jf  +  1 
and  resting  at  vertex  j  +  1-  We  then  want  to  know:  how  long,  on  average. 
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before  the  walk  visits  vertex  1  or  vertex  j  +  2,  whichever  occurs  first?  If  we 
let  Cr  represent  the  expected  number  of  steps,  starting  at  vertex  r  to  first 
visit  either  vertex  1  or  jf  +  2  on  such  a  path,  then  Ci  =  0^+2  =  0  and  for 
1  <  A;  <  jf  +  2, 

~  (2)  +  ^)  +  (2)  ^ 

The  solution  to  this  recurrence  relation  is  cv  =  r  —  +  (r  —  l)(j  +  2).  Which 

provides  Cj+i  =  j. 

Now  the  vertex  cover  time  is  easily  computed. 

Due  to  the  symmetry  of  the  cycle  this,  as  well  as  all  edge  cover  times,  is 
independent  of  the  starting  vertex.  From  now  on  we  will  assume  the  cycle  is 
labeled  in  the  natural  way  and  that  all  walks  begin  at  vertex  1. 

5.2  The  Undirected  Edge  Cover  Time  of  Cn 

Path  results  may  also  be  used  to  compute  the  edge  cover  times  for  cycles.  In 
the  case  of  the  undirected  edge  cover  time  we  do  nothing  more  than  extend 
the  connection  above.  In  the  directed  case  we  need  to  use  the  more  delicate 
results  established  in  Chapter  3.  As  we  have  done  in  the  previous  cases  we 
will  calculate  the  edge  cover  time  by  first  finding  the  vertex  cover  time  and 
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then  analyzing  the  expected  number  of  additional  steps  needed  to  complete 
an  edge  cover.  In  Chapter  3  we  pointed  out  that  the  undirected  edge  cover 
time  of  a  tree  is  the  same  as  the  vertex  cover  time.  This  is  not  the  case  for 
the  cycle. 

Theorem  5.1  The  undirected  edge  cover  time  for  Cn  is 

("') 

Proof 

Upon  completing  a  vertex  cover  of  Cn,  say  at  vertex  j,  the  only  undirected 
edge  not  yet  traversed  is  either  {jf ,  Jf  + 1 }  or  {j,  jf  —  1 } .  The  expected  number  of 
additional  steps  needed  in  order  to  traverse  this  last  edge  may  be  calculated 
by  considering  a  walk  on  a  path  containing  n  +  2  vertices.  The  walk  on 
the  cycle  may  be  considered  to  have  been  a  walk  which  has  visited  vertices 
{2, 3, . . . ,  n,  n  +  1}  of  this  path  and  which  is  now  located  at  vertex  rr  +  1. 
Then  if  we  compute  the  expected  amount  of  time  necessary  for  such  a  walk 
to  either  visit  vertex  1  or  vertex  n  +  2,  whichever  occurs  first,  then  this  will 
be  the  expected  number  of  steps  necessary  to  traverse  the  remaining  edge  on 
the  cycle.  The  argument  above  shows  that  it  takes  n  steps  to  reach  either 
vertex  1  or  vertex  n  +  2  of  the  path.  Combining  this  with  the  vertex  cover 
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time  for  Cn  we  have 

C--  =  C^’+n=  (2)  +''=  ^2  ^)'  " 

5.3  The  Directed  Edge  Cover  Time  for  Cn 

Lovasz  and  Winkler[19]  point  out  that  it  is  “folklore”  that  the  probability 
a  particular  vertex  j  1  is  the  last  new  vertex  visited  by  a  random  walk 
started  at  1  on  Cn  is  l/(n  —  1).  The  key  to  analyzing  the  directed  edge  cover 
time  is  to  realize  that  upon  completing  the  vertex  cover  portion  of  the  walk 
the  sequence  of  vertices  visited  may  be  mapped  to  a  sequence  describing  a 
vertex  cover  of  Pn  with  the  starting  point  determined  by  the  vertex  at  which 
the  walk  completes  a  vertex  cover  of  Cn-  Using  this  fact  we  will  be  able  to 
relate  the  probability  distribution  for  untraversed  edges  on  the  path  to  that 
of  edges  on  the  cycle.  We  will  then  use  this  to  determine  lower  and  upper 
bounds  on  the  expected  number  of  additional  steps  needed  to  complete  an 
edge  cover  after  all  the  vertices  have  been  visited  at  least  once. 

Theorem  5.2  The  directed  edge  cover  time  for  a  simple  random  walk  on  Cn 
is 


Proof 
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1 


Figure  5.1:  Vertex  cover  walk  of  Cg 

Suppose  we  label  P„  and  (7„  in  the  natural  way.  Also  suppose  that  we  take 
a  walk  on  Cn  and  it  completes  a  vertex  cover  at  vertex  2  <  j  <  n  having 
stepped  to  j  from  vertex  j  —  1.  The  probability  that  this  occurs  may  be 
determined  by  a  simple  probability  tree  analysis  which  yields  (j  —  l)/n{n—l). 

If  we  map  vertex  {j  +  k)modn  on  Cn  to  vertex  k  on  Pn,  so  that  1  on  (7„ 
maps  to  n  —  J  +  1  on  and  j  on  Cn  maps  to  n  on  P„,  then  the  sequence  of 
vertices  visited  on  will  correspond  to  a  vertex  cover  walk  on  P„  started 
at  vertex  n  —  j  +  1  and  ending  at  vertex  n.  A  vertex  cover  walk  on  Cg 
is  illustrated  in  figure  5.1,  and  the  mapping  to  a  vertex  cover  walk  on  Pg  is 
illustrated  in  figure  5.2.  As  a  matter  of  fact  this  mapping  describes  a  bijection 
between  vertex  cover  walks  starting  at  vertex  1  and  ending  at  j  on  Cn  and 
vertex  cover  walks  on  P„  starting  at  vertex  n  —  j  +  1  and  ending  at  vertex  n. 
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Figure  5.2:  Vertex  cover  walk  from  figure  5.1  mapped  to  vertex  cover  walk 
of  Ps  ending  at  8 


Now  consider  how  this  walk  on  Cn  must  have  visited  the  vertices  in  the 
cycle.  In  order  to  end  a  vertex  cover  at  j  stepping  to  it  from  j  —  1  the  walk 
must  have  traveled  back  and  forth  to  either  side  of  the  starting  vertex  until 
it  visited  vertex  j  +  1.  It  then  traveled  back  and  forth,  never  proceeding 
beyond  j  +  1  until  it  finally  visited  j  for  the  first  time  from  j  —  1.  This  means 
that  all  edges  between  vertex  j  +  1  and  vertex  1  have  been  traversed  in  both 
directions  as  illustrated  by  the  walk  in  figure  5.1.  Thus  the  only  possible 
edges  not  traversed  during  the  vertex  cover  portion  of  the  walk  are  directed 
edges  (j,  j  +  1),  {j  +  l,j),  {j,j  -  1),  ... ,  (2, 1).  Given  the  bijection  between 
walks  on  the  cycle  and  walks  on  the  path  described  above  we  can  describe 
the  probability  that  t  is  the  least  integer  such  that  edge  —  is  the  closest 
edge  to  vertex  1  not  traversed  during  the  vertex  cover  portion  of  the  walk  on 
Cn,  ioT  2  <  t  <  j  —  1.  Using  the  notation  of  Section  3.4  this  probability  is 
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If  edge  {t,  t  —  1)  is  the  edge  closest  to  vertex  1  not  traversed  so  far,  then 
we  may  obtain  an  upper  bound  for  the  expected  number  of  steps  remaining 
to  complete  an  edge  cover  by  the  expected  amount  of  time  to  traverse  the 
edges  {j,j  +  1),  {j  +  l,j),  ij,j  -  1),  {j  -  l,j  -  2),  . . . ,  (t  +  l,t),  {t,t-  1)  in 
this  particular  order.  Once  again  a  recursive  argument  may  be  used  to  show 
that  the  expected  number  of  steps  needed  to  traverse  edge  (s,  s  +  1)  starting 
at  s  on  On  is  n  +  1 .  It  follows  that  the  expected  number  of  steps  to  traverse 
these  edges  in  the  specified  order  is  (jf  —  t  +  3)(n  +  1). 

Thus,  given  that  edge  <t<  j  —  1,  is  the  closest  edge  to  vertex  1 

not  yet  traversed,  an  upper  bound  on  the  expected  number  of  remaining  steps 
is 

0  -  t  +  3)(n  +  I  .4ir“+.).  (5.1) 

If  the  closest  edge  to  vertex  1  not  traversed  was  (j,  j— 1),  then  an  upper  bound 
on  the  expected  number  of  steps  remaining  is  3(n  +  1)  and  the  probability 
that  this  occurs  is  less  than  1. 

Putting  this  together  with  the  probability  that  a  walk  does  end  a  vertex 
cover  at  j  by  stepping  from  j  —  1,  the  additional  number  of  steps  needed  is 
bounded  above  by 
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E 

J=2 


7-1 


^n(n  —  1) 

3-1 


S(n  +  1) 


+  53  (i  ~  ^  +  3)(n  +  I  ^;^r7+i) 

t=2 


(5.2) 


Remembering  the  symmetry  of  Cn,  this  also  describes  a  bound  on  the 
additional  number  of  steps  needed  for  walks  ending  a  vertex  cover  at  j  by 
stepping  from  j+1.  Thus  an  upper  bound  on  the  expected  additional  number 
of  steps  needed  to  complete  an  edge  cover,  having  visited  all  vertices  at  least 
3,  is  twice  the  quantity  in  (5.2).  Denote  this  value  by  ho¬ 
using  equation(3.23)  yields 

^  ^  3(n  +  1)  +  ^  (jf  -  t  +  3)(?t.  +  1)  X 

^rn-j+t  I  Al->n-j+t\ 

j-\-l,n-j+t  I  ^n-j+1  ) 


once, 


EO-1) 


t-l 


+  2  (E±l^  ( _ 

n  )  -  t  +  I )  \n  -  j  -\- 1  -  \ ^ 


X 


<  3(n+l)  +  4 


j=2 t=2 


n  +  1 


n 


P(X, 


n-j+t  I  A  l^n-j+t-, 

-j+l,n-j+t  I  -^n-j+1  ) 


t-  1 


n  j—1 

rn~j+t 


P(  Y"“7+'  A 


(5.3) 


Note  that  since  the  first  term  in  this  sum  is  o(n^/ log  n)  we  need  only 
prove  that  the  second  term  is  0(n^/ log n).  Also  note  that  in  the  sum  in 
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this  inequality  the  term  with  jf  =  2  is  zero,  so  the  sum  for  which  we  desire  a 
bound  is 


5  =  4 


/n+  1\  j 

^  i-1  \ 

\n-  j  +  t-l) 

Using  equation(3.26)  (5.4)  becomes 

„  I  1 X  n  j-i t-i 

s  =  (fr-/r-l) 
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t  —  r 
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i=3t=2 r=l 
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i-1 


=  4r^)EE(/r-/-.)  E 

j=3r=l  t=r+l 


n  —  j  +  t  —  rj 

f  ^  ^ 

\n-j  +  t-r^ 


jf— 2  j-r-l 


5  '• lUn) 
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i=3 

jf-3  j-r-2 

n  i-2 


\  n-r- 

I  s  * 


=  4i^)E  (- 

n  J  \n  —  r 


fr 


n—r—1 


r=\ 

n—2 


Jk=l 


2f^^)  E  («-0/r 

'  r=l 


n 


<  2l'^)E(n-r)/,. 


But  by  definition  (3.8)  this  gives 


5  =  2 


(T)- 


(5.5) 


Thus 


<  3(n 


n  +  1\ 

]  f/n— 1) 
n  J 
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and  in  Lemma  3.4  we  have  shown  that  g-n-i  =  0(n^/logn).  Therefore  the 
expected  number  of  steps  needed  to  complete  an  edge  cover  after  first  visiting 
all  the  vertices  on  Cn  is  0(n^/iogn). 

This  upper  bound  was  shown  based  on  traversing  the  remaining  directed 
edges  in  a  specific  order.  We  may  obtain  a  lower  bound  on  the  additional 
number  of  steps  needed  by  determining  the  edge  closest  to  vertex  1  which  has 
not  been  traversed,  and  then  computing  the  hitting  time  from  our  present 
position  back  to  traverse  that  edge.  Simple  recursive  arguments  show  that 
the  hitting  time  from  vertex  s  to  traverse  edge  {t,t  —  1),2  <  t  <  s  —  1  is 
(s  —  t  +  l)(n  —  s  +  t+1)  steps.  Thus  a  lower  bound  on  the  expected  additional 
number  of  steps  to  complete  an  edge  cover  is 

(yrTTl)  I  (5.6) 


Once  again  using  equation (3. 26),  and  simplifying,  (5.6)  becomes 


>  SEE 

j=3  t=2 


n 


)  E  (/.  -  fr-l) 

'  r=l 


t  —  r 


n  -  j  t 


> 


o  n  J  — 1  t—l 

j=3t=2  r=l 
n  jf  — 1  f— 1 

^)eee^ 

j=3t=2  r=l 
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Using  Corollary  A. 5  we  then  have 


Thus  we  have  the  desired  result  that  An  =  ©(n^/logn)  proving  that 
^Cn  =  (2)  +0(^Vlogrr). 
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CHAPTER  VI 


QUESTIONS  FOR  FURTHER 

STUDY 


We  established  that  Sn  minimizes  edge  cover  time  for  trees.  Which  class  of 
graphs  minimizes  edge  cover  time  for  all  graphs?  Which  maximizes  it? 

Our  results  for  edge  cover  time  on  trees  seem  to  suggest  that,  for  fixed  n, 
the  more  leaves  a  tree  contains  the  smaller  the  number  of  steps  needed  to 
complete  a  vertex  cover  walk  to  an  edge  cover  walk.  If  one  could  prove 
Brightwell  and  Winkler’s  conjecture  that  vertex  cover  time  for  trees  is  max¬ 
imized  by  the  path,  then  this  observation  would  seem  to  indicate  the  same 
could  be  proven  for  edge  cover  time. 

Brightwell  and  Winkler[6]  established  that  the  maximum  hitting  time  for 
any  graph  on  n  vertices  is  obtained  by  a  walk  on  the  “lollipop”  and  has 
order  (4/27)n^  +  o(n^).  Feige  went  on  to  prove  that  the  vertex  cover  time 
for  any  graph  on  n  vertices  is  (4/27)rt^  +  which  makes  the  lollipop 

graph  a  candidate  for  the  graph  which  maximizes  vertex  cover  time.  Since 
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Zuckerman’s  bound  proves  that  edge  cover  time  is  0{n^)  it  seems  plausible 
that  the  “lollipop”  may  maximize  edge  cover  time  as  well.  Is  there  a  way  to 
prove  this?  Simulations  on  this  graph,  for  small  n,  seem  to  indicate  that  the 
expected  additional  number  of  steps  needed  to  complete  a  vertex  cover  to  an 
edge  cover  is  0{n^).  Can  this  be  proved,  perhaps  using  methods  similar  to 
those  used  to  establish  our  global  upper  bound? 

Bounds  for  vertex  cover  time  tighter  than  those  developed  by  Matthews 
and  Aleliunas  have  been  established  based  on  knowing  the  maximum  degree, 
average  degree,  or  that  the  graph  is  regular.  Can  improved  bounds  be  derived 
for  edge  cover  time  based  on  these  parameters? 

The  blanket  time  provides  a  measure  of  when  the  walk  has  “distributed” 
itself  evenly  amongst  the  vertices  of  the  graph.  If  we  define  a  directed  and 
undirected  “edge  blanket  time”  in  a  similar  manner,  can  we  establish  bounds 
on  it? 

Winkler  and  Zuckerman[25]  use  ad  hoc  methods  to  prove  that  the  blanket 
time  for  the  path  and  cycle  is  0{C'’).  Their  results  only  provide  a  bound  on 
the  constant  of  Intuitively  it  seems  to  us  that  the  constant  in  both  cases 
should  be  small.  This  is  because  a  walk  on  the  path  started  at  an  endpoint 
spends  much  of  its  time  in  the  “first  half”  of  the  path  and  very  little  in  the 
“second  half”.  Thus  if  a  walk  is  allowed  to  complete  say  4  vertex  covers 
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then  it  seems  likely  that  it  has  distributed  itself  evenly  over  both  ends  of  the 
path  and  the  only  possible  area  of  difficulty  would  be  the  number  of  visits  to 
the  vertices  in  the  center.  However,  computer  simulations,  using  code  which 
has  proved  very  accurate  in  our  study  of  vertex  and  edge  cover  time,  shows 
the  ratio  slowly  growing  from  approximately  2.4  for  a  path  of  10 

vertices  to  4.9  for  a  path  on  100  vertices.  In  the  case  of  a  cycle  the  ratio 
grows  from  2.7  for  a  cycle  on  5  vertices  to  6.2  for  a  cycle  on  200  vertices.  The 
simulations  do  not  give  any  evidence  that  this  growth  is  ending.  If  the  results 
of  Winkler  and  Zuckerman  are  correct,  then  what  is  the  correct  constant?  Is 
it  possible  that  there  is  no  constant  and  in  fact  the  ratio  is  unbounded? 

We  have  established  tight  bounds  on  the  edge  cover  time  for  a  number 
of  trees  and  the  cycle.  Aldous’  and  Zuckerman’s  results  combine  to  provide 
a  tight  bound  for  the  clique.  Can  our  techniques  be  used  to  provide  tight 
bounds  on  other  classes  of  graphs? 

Many  of  the  results  for  vertex  cover  time  have  been  generalized  to  cases 
in  which  each  edge  is  given  a  weight  and  the  probability  of  traversing  any 
particular  edge  out  of  a  vertex  is  not  uniform.  Is  there  a  way  to  do  this  in 
the  edge  cover  time  case  applying  our  results? 
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APPENDIX  A 


PROPERTIES  OF  {/„} 


In  Sections  3. 1-3.3  we  considered  random  walks  on  Pn  started  from  an  end¬ 
point.  We  assumed  the  path  was  labeled  in  the  natural  way  with  the  start 
point  being  vertex  1  and  the  other  endpoint  being  vertex  n.  For  each  edge 
{k,  k  —  1) ,  2  <  k  <  n,  we  determined  that  the  probability  the  edge  was  not 
traversed  prior  to  the  walk  completing  a  vertex  cover  was  l/(n  —  k  +  1). 
Given  that  edge  {k,k  —  1)  was  not  traversed  prior  to  the  completion  of  the 
vertex  cover,  we  then  calculated  the  probability  that  it  was  the  first  such 
edge  and  labeled  that  probability  fk-i-  In  this  section  we  collect  a  number 
of  facts  about  the  sequence  {/„}  which  have  proved  useful  in  our  analysis  of 
the  edge  cover  time  of  random  walks  on  graphs. 

After  defining  the  /„  in  Chapter  3  we  noted  that  they  fulfilled  identity  (3. 5). 
This  gave  us  a  recursive  method  for  computation  of  {/n},  and  made  it  clear 
that  this  is  a  sequence  of  rational  numbers.  Using  Mathematica  it  was  an  easy 
task  to  compute  the  values  of  the  sequence  up  to  /rso-  Table(A.l)  contains 
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/2 

fs 

/4 

h 

fc, 

h 

h 

fo 

/lO 

1 

1 

5 

3 

251 

95 

19087 

5257 

1070017 

25713 

2 

12 

8 

720 

288 

C0480 

17280 

3628800 

89600 

Table  A.l:  Exact  values  of  fi  —  fio 


the  values  for  /«,  1  <  w  <  10. 

A  second  identity,  in  this  case  involving  a  weighted  sum  of  the  harmonic 
numbers,  may  be  derived  directly  from  identity (3. 5).  Specifically, 

1 


n  ] 


EE  73T^T  A  =  E  1. 

7=lfc=l  \J  ,=  1 


n  /  1 


=  n, 


fkhn-k  =  n.  (A.l) 

*=1 

The  connection  between  the  elements  of  {/„}  and  the  harmonic  numbers  is 
pervasive. 

The  values  in  Table  A.l  clearly  suggest  that  the  /„  form  a  decreasing 
sequence.  This  turns  out  to  be  the  case. 


Lemma  A.l  The  sequence,  {/„},  is  a  strictly  decreasing  secpience. 


Proof 

Using  identity (3.5)  we  have  for  n  >  1, 


E 


fc=i 


ITt) 


(  ^ 

\n  —  k  +  2 


0. 
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So 


n  +  1 


Writing  Sn  =  fn-  fn+i  yields 


Then  for  n  >  2 


n  +  1 


Sn 


< 


n  —  I' 


n 


)ti^. 


Sk- 


Replacing  the  sum  by  (A. 3)  yields 


(A.2) 


(A.3) 


1  n  —  1 


n  +  1 


n‘‘ 


Thus  Vn  >  2, 


1 


>  0. 


ri?{n  +  1) 

Combining  this  with  the  fact  that  Si  =  1/2  we  have  Vn  >  1, 


(A.4) 


fn  —  fn+1  >  0. 


Using  the  fact  that  the  sequence  is  strictly  decreasing  we  can  determine 
an  upper  bound  on  fn- 
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Lemma  A. 2  For  all  n>  2, 


(A.5) 


Proof 

Using  inequality  (A.  4)  and  iterating  we  have 

1 


(1  —  fn)  >  (1  —  fn~l)  + 


n 


(n  -  1)2 


(1  ~  fn)  >  (1  --  fn-f)  + 


1 


+ 


1 


n(n  —  1)2  (n  —  l)(n  —  2)2 


n— 2 


1 


(1  fn)  >  (1  /l)  +  E  1)2- 


It  follows  that 


k=2 


t  fUd-/-)  >  t  fU(d-/-‘+-)  +  E 


k=2 


k, 


f^o  (^- -  1)^ 


(hn  -  1)  -  (1  -  fn) 


+  Y.  -  hj+i)  ( 

J=0  \ 


{n  -  j){n  -  j  -  \)'^ )  ' 


This  then  implies  that 


n— 2  / 

~k>nfn  ^  1  T  ^  ^  {f^n  ^U+l)  I 

7=0  \ 


(n  -  j)(n  -  j  ~  l)y  ■ 


Multiplying  this  inequality  by  -l//i„  and  discarding  all  but  the  last  term  in 
the  sum  on  the  righthand  side  we  determine  that 


It  is  worth  noting  that  one  can  improve  on  this  bound  slightly  by  using 
equation(3.13).  If  one  integrates  this  quantity  using  bounds  similar  to  those 
in  the  argument  at  (3.16)  one  obtains  the  bound 


fn+l  < 


-1- 

hn 


Va  >  1. 


It  appears  that  establishing  a  lower  bound  for  /„  is  not  quite  so  easy. 


Lemma  A. 3 

/„  =  (!-  0(1))  (i)  .  (A.6) 


To  facilitate  obtaining  such  a  bound  we  introduce  a  new  quantity.  Define 


Qn  by 


(A.7) 


Claim 


{(]n  -  l)hn  =  o(l). 


Proof 


Qnhn  firi 


^  \n  -  k  +  l)  hk 
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n  —  k  +  I 


hn 
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^)( 


Tl— 1 

E 

k=l 


hfi  hk 

hk 


1 


n  —  k  +  1 


hn  hk 
.  hk  . 


(A.8) 


Since  /i„/n  =  o(l)  it  suffices  to  show  that  the  sum  is  bounded  above  by  a 
function  that  approaches  0  as  n  grows.  We  will  do  this  by  first  splitting  this 
sum  into  three  parts.  For  simplicity  of  notation  let  N  =  -j^J.  Since  these 
quantities  contain  the  harmonic  numbers  we  will  make  frequent  use  of  the 
inequalities 

Inn  +  7  <  /in  <  Inn  +  7  + 

2n 

where  7  =  limn— 00 (/^n  —  hm)  is  Euler’s  constant. 

(A.)  I 


There  are  less  than  terms  in  this  sum.  For  large  n,  and  in  the  range 
2<k<N, 
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n  —  A;  +  1 


1 

hk 
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1 


n 


and 


Thus 


(/in  -  hk)  <  {hn  -  -). 


N 

E 

fc=2 


1 


n  —  k  +  I' 


hn  hk 
hk 


< 


< 


n 


In^n/  \n 

^  =0{1). 


‘ -  5) 


Inn 


91 


(B.) 


fc=iV+l 


In  this  sum 


hn  h/c 

/ifc 


^  j  hn  h]<l|-^-l  \ 

~  V  h^+i  J 
^  nnn  +  ^-\n  (A^+1) 
-  V  ln(Ar+l) 

<  /  ^  +  21nlnn  \ 

“  I  In  n  —  2  Inin  n  )  ' 


(A.9) 


n—N  —  l 


k=N+l 


n  —  k+\ 


<  hn-N  —  /iA?+2 

<  In  (n  —  A^)  + 


+  /  1  N  -In(iV  +  l) 

2"  (l  - 


1 _ ,  /  n 

^  2n  (l  -  ■  "''In"" 


=  ln(l  +  i-^) 

\  n  In  n/ 

H - 7 - ^  +  2  In  Inn 

2"  (i  -  iA:) 

<  - 7 — ^  +  2  In  Inn. 

2"  (i  -  iA;) 


(A.IO) 
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Combining  inequalities(A.9)  and  (A.  10)  we  have 


n—N—l 

E 

k=N+l 


1 


n  —  k  +  1. 


hfi  hk 

hk 

^  /  ^  +  21nlnn\"-A"^ 


)S.  (^) 
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Here  we  have 


hn  hk 
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However  ln(a;)  <  x  —  1.  So 

hn  hk\  ^  2n  hi’-^n— 1 


/V  I  ^  _ £76 _ Hi  11— 1. _ 

hk  y  -  lnn  +  ln(l-i^) 


Thus 


71—1 

E 

k=n-N 


1 


n  —  k+  1. 


hn  hk 
.  hk  , 
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tablished  that  Vn  >  2,/„  <  (1  —  l/2n)(l//j„).  From  identity(3.5)  we  know 
that 

~  [(5)  +  G)  + (d^i)  (^)  ■ 


Thus  Vn  >  3 


Then  subtracting  1  //i„  from  both  sides  of  this  inequality  yields 


which  leads  to  the  conclusion  that 

fn  ^1  I,  ■ 

Using  the  conclusion  of  the  claim  proved  above  we  have  the  desired  result 

/„=  (l-o(l))^.  ■ 

n— 1 

In  Section  3.3  we  needed  to  bound  the  quantity  (7„  =  Yj  —  Using 

j=i 

results  from  Jordan[17]  we  proved  an  upper  bound  of  0{n^ / \ogn).  We  also 
claimed  that  it  is  possible  to  lower  bound  the  quantity  by  /  logn).  Below 

is  the  proof  of  that  claim. 


Lemma  A.4 


On 


(W  -  j)fj  =  ^ 


3=1 


(A.ll) 
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Proof 


The  proof  will  be  by  induction  on  the  “forward  differences”  of  /^Qn-x  = 

n 

Qn  —  Qn-i  —  Yl  fk  denotes  the  first  forward  difference  of  gn  and  = 

k=0 

A{Agn-2)  fn  the  second  forward  difference.  As  a  result  of  Lemma  A. 3  we 
know  that  fn  >  (l/2)(l/h„),Vn  >  1.  Claim  Agn-i  >  (l/2)(n//i„). 


Agi  =  fi  +  f2 


3 

2 


Now  suppose  the  claim  is  true  Vj  <  n. 

^ gn—2  —  Agn—1  Agn—2  fn 


implies  that 


Agn~x  —  Agn—2  T  fn- 


But  by  the  inductive  hypothesis 


Agn-2  +  /n  ^ 


> 


n  —  1^/1\  1 

hn —  1  \  2  /  hn 

{n  -  1)^  +  (n  -  l)hn-i  +  hn-x 

hjihn—X 

n 

hn 
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Now  that  we  know  A^'n-i  >  (|)  ^  the  rest  of  the  induction  is  straight¬ 
forward. 


51  =  1  >  4- 


Suppose  Qj  >  (i)  <  n.  Then 


On  9n-l  -  ^On-l  >  Q) 


implies 


T\  n 
hn 


On  >  Qn-l  +  f-) 

V2;  K 


4)  K  . 

1^  (n  -  1)^^  +  {n-  l)^hn-i  +  2nhn-i 
'l\n^  +  l  /l^n^ 


hnhn~l 

(-)- 
V4y  hn 


Thus  Qn  =  n(n^/logn),  as  desired. 


In  the  course  of  the  proof  above  we  established  that  /^g-n-i  =  ^  fk  ^ 

k=0 

(l/2)(n/h„).  Using  an  argument  similar  to  that  of  Lemma  3.4  one  can  prove 
that  Agn  =  0{n/\ogn).  Thus  an  immediate  corollary  to  Lemma  A. 4  is 


Corollary  A. 5 


n 


E  A  =  ^  , 

^  Vlogn, 
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